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1917: 
A JOURNAL IS BORN 


JOHN G. DARLEY 


University of Minnesota 


1967 represented the Soth yr. of publication of this Journal. A reading of 10 
complete volumes on a quinquennial cycle starting in 1917 suggests that the 
classic problems of applied psychology were approximately adumbrated by 
about 1922, except for the area of biomechanics. Many of today's fashionable 
concepts, such as moderator variables, cross-validation, question format, are 
easily discernible in early volumes. The disappearance of several early tests 
as the technology improved is noticeable. Judging by bibliographic citation 
patterns, applied psychologists share psychologists’ generally cavalier attitude 
toward history. Thus this excursion into the history of 1 journal may com- 
mend itself to other amateur historians as an interesting and possibly worth- 


while didactic exercise. 


The time was March 1917. The United 
States was engaged in the “war to end war”; 
it came to be known later as World War I. 
Volume 1, Number 1, of the Journal of Ap- 
plied Psychology, Florence Chandler of Wor- 
cester, publisher, went to a subscription list 
of now unknown size. Its foreword by G. 
Stanley Hall, John Wallace Baird, and L. R. 
Geissler described its goals: 


The past few years have witnessed an unprecedented 
interest in the extension of the application of psy- 
chology to various fields of human activity... . 
Thence the attention of the applied psychologist 
turned to the more comprehensive and fundamental 
problem of vocational selection... and it must 
appeal to every human being who is interested in 
increasing human efficiency and human happiness by 
the more direct method of decreasing the number of 
cases where a square peg is condemned to a life of 
fruitless endeavor to fit itself comfortably into a 
round hole . .. none of the existing journals devote 
themselves to the task of gathering together the 
results of workers in the various fields of applied 
Psychology, or of bringing these results into relation 
with pure psychology. . .. In view of all these 
circumstances the undersigned have decided, not 
without hesitation, to launch a Journal of Applied 
Psychology,—their hesitation being of financial ori- 
gin because they have no pecuniary resources in 
pres or in prospect, excepting their own... [pp. 
fils 


1 


The masthead identified the cooperating 
editors; these were the first of many psycholo- 
gists who served effectively in this anonymous 
but crucial gate-keeping role: W. V. Bing- 
ham, June E. Downey, Robert H. Gault, Dr. 
William Healy, H. L. Hollingworth, Joseph 
Jastrow, Herbert S. Langfeld, James H. 
Leuba, Henry C. McComas, Eleanor Harris 
Rowland, Walter D. Scott, C. E. Seashore, 
Daniel Starch, Edward K. Strong, Jr., L. M. 
Terman, E. L. Thorndike, J. E. Wallace Wal- 
lin, G. M. Whipple, Helen Thompson Wool- 
ley. The masthead goes on further to say: 


The field of the Journat will include the follow- 
ing: 

(a) The application of psychology to vocational 
activities, such as law, art, public speaking, indus- 
trial and commercial work, and problems of business 
appeal. 

(b) Studies of individual mentalities, such as types 
of character, special talents, genius, and individual 
differences, including the problems of mental diag- 
nosis and vocational prognosis. 

(c) The influence of general environmental condi- 
tions, such as climate, weather, humidity, tempera- 
ture; also such conditions as nutrition, fatigue, etc. 

(d) The psychology of everyday activities, such as 
reading, writing, speaking, singing, playing games or 
musical instruments, sports, and the like. 

In reply to several inquiries the editors wish to 
state that those mental tests which aim to deal with 
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purely educational problems or abnormal minds do 
not as a rule fall within the scope of the JOURNAL 
(pp. 1-2). 


The contributors were expected to pay cer- 
tain costs but “as soon as the JOURNAL be- 
comes self-supporting, or as soon as the pres- 
ent excessive prices of paper and printing 
abate, the above terms will be made more 
liberal (p. 2].” The record indicates that the 
excessive prices did not abate. 

In his autobiography, Hall (1923) makes 
only the most casual reference to this new 
Journal. Writing about the evolution of the 
testing movement as an outgrowth and exten- 
sion of one of his major loves—child study— 
he says: 


It is in the great stress now laid upon these tests, 
too, that psychology in general has enormously in- 
creased its utility, but not without grave danger of 
lapsing from culture to Kultur, since the later meth- 
ods seek practical results with individuals and have 
added little to our knowledge of child nature in 
general. In this latter direction, however, Clark Uni- 
versity has made it a point for the last decade or 
more to have some competent authority to apply 
tests gratis to all children from the city sent or 
brought to it, and in recent years Dr. L. R. Geissler 
gave his entire time to making such tests and in- 
structing others to do so here, and we founded here 
the Journal of Applied Psychology, an organ for the 
development of this work which, as I write, is in its 
sixth volume [p. 399]. 


This is the only mention of Geissler in the 
-autobiography; there is no mention whatever 
of Baird, the other coeditor. Baird had been 
at Clark since 1910; he had been in charge 
of the laboratory there and in 1918 served as 
president of the American Psychological As- 
sociation. He died in 1919, having served as 
coeditor for only 2 years. 

Hall was far more enamored of the Ameri- 
can Journal of Psychology, which he founded 
in 1889, and the Pedagogical Seminary, which 
he founded in 1893. In another place in his 
autobiography, he speaks more harshly of 
psychometrics: 


We have no method of testing testers, whose compe- 
tence varies vastly from mere mechanical accumula- 
tion of data and the busy work of computing corre- 
lations up to real insight and sagacity. . . . I would 
cast no discredit upon this work, much as I deplore 
its present monopoly of psychological endeavor, but 
to my mind no one has made it sufficiently clear 
just what even the most used of them really tests, 
and still less justified for either psychology or for 
life the value of the powers they activate. All of 
them together have done a great work in applying 
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psychology to life and industry but have added 
scarcely a scintilla to our knowledge of the human 
soul [p. 450]. 


Editors may propose, but authors and the 
availability of manuscripts dispose. There 
were 35 articles, 37 authors, and 399 pages 
in Volume 1. The volume also included book 
reviews and notices, brief notes and summa- 
ries, and a section on psychology and mili- 
tary affairs covering the deployment of psy- 
chologists, the work of special committees, 
and other speeial activities related to World 
War I, signed by Robert M. Yerkes, Major, 
Sanitary Corps, and Chairman Psychology 
Committee, National Research Council. 

Using the editors’? four fields for the new 
Journal, these 35 articles are classified in the 
1917 column of Table 2. It is evident that 
contributions were accepted under less than 
rigorous criteria in the first year. The essays 
ranged from Hall’s own APA twenty-fifth an- 
niversary speech on psychology and the war 
through two philosophic articles by A. A. 
Roback, in one of which he attacks Munster- 
berg’s avoidance of moral and ethical issues 
in applied psychology. The doubtful felicity 
of this critique lies in the fact that William 
Stern’s memorial tribute on the occasion of 
Munsterberg’s death is found in the same 
volume. Other essays deal with applications 
of psychology in a wide range of endeavor— 
with its utility to business men, its promise 
of solutions to the problems of labor turnover, 
its value in increasing the efficiency of work- 
ers. In the main, these are characterized by a 
high moral tone, by hope rather than evi- 
dence, and by a very conservative economic 
outlook. . 

By today’s standards, statistical analyses 
were primitive and often missing. A few arti- 
cles, however, are sophisticated in design, 
analysis, or presentation. Baird’s study of the 
legibility of telephone directories in various 
type faces; Burtt’s analyses of Munsterberg’s 
tests applied to various samples; Miner’s cor- 
relational study of rating methods; Pintner 
and Toops’ test analyses of unemployed men; 
Scott's presentation of test results of appli- 
cants versus successfully employed as a vali- 
dating method; Terman’s use of psychological 
tests in selecting policemen and firemen; 
Thorndike’s study of decrements in work out- 
put and work satisfyingness; Yerkes’ com- 
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parison of Binet and point scale measures of 
intelligence—each of these is a delight to read. 
Wallin, in a long case history of a stutterer, 
takes a direct slap at Freudian theory: 


But to maintain that all the words in a custom- 
made list which are associated with delayed re- 
sponses reveal an inner conflict or emotional com- 
plex, may suffice for the unsophisticated devotee of 
a cult but surely cannot satisfy the critical experi- 
mental scientist who demands facts capable of ob- 
jective verification [pp. 358-359). 


Garrison, Burke, and Hollingworth present a 
case report on a superior child first tested at 
8 years and 4 months of age, with a Stanford- 
Binet of 187; follow-up reports on this child 
appear with pleasant regularity in Volumes 6 
and 11 of the Journal. 

And so the new journal was launched. It 
took its place in brilliant company: Psycho- 
logical Review was, in 1917, 23 years old and 
its monograph supplements first appeared in 
1896; the Journal of Experimental Psychology 
had been established in 1916; the Psycho- 
logical Bulletin was 13 years old; the Psycho- 
logical Index had been in existence since 
1894; the Journal of Abnormal Psychology 
first appeared in 1906; the Journal of Educa- 
tional Psychology was first published in 1910; 
Psychological Monographs first appeared in 
1910; G. Stanley Hall's first love, the Ameri- 
can Journal of Psychology, started publica- 
tion in 1889, the year in which Hall opened, 
as president, the doors of Clark University. 

To see how the journal evolved, I have read 
Volumes 1, 6, 11, 16, 21, 26, 31, 36, 41, and 
46 in their entirety, thus sampling almost a 
half-century of the Journal’s life. Some of 
the material I have tabled. (See Tables 1, 2, 
and 3.) At other points I have commented on 
trends or items of interest—to me at least.* 

The chronology of editors and ownership 
deserves brief comment. In 1921, James P. 
Porter, Clark University, and William F. 
Book, Indiana University, appeared as editors. 
Hall had apparently sold the Journal in 1920 
or 1921; his ardor for it, certainly, had cooled. 
Porter left Clark to go to Ohio University; in 
1922 his new affiliation first appeared. By 
1928, Porter was the sole editor; Book ap- 


1 The structure of this review owes much to David 
Cohn’s (1940) The Good Old Days, a fascinating and 
exciting treatment of our social history. I regret my 
lack of time and skill to duplicate Cohn’s analysis. 


TABLE 1 


Numeers or Articies, Packs, Axo Autiiors 
IN SELECTED VOLUMES or THE JOURNAL 
or AppLIED PsycHoLocy 


No. 
Year Volume No. No. No, early 
| number | articles | pages authors publi- 
| | cations 
1917 | 1 35 399 37 j — 
1922 6 33 433 37 — 
1927 1 S44 51 = 
1932 16 | 68 717 81 * 
1937 21889 735 66 <7 
1042 26 81 887 102 — 
1947 31 | 80 664 96 27 
1952 36 97 433 132 21 
1957 41 89 418 131 4 
1962 46 85 432 141 7 


Note. — Book reviews, notes and news, materials received 
appeared through Volume 39. Contemporary Psychology, estab- 
lished in 1956, removed these pages from the Journal of Applied 
Psychology. 


parently withdrew in 1927. The Psychological 
Bulletin for 1943 (Volume 40, page 652), re- 
porting the annual business meeting of the 
APA, noted that the Journal was purchased 
from James Porter that year. Volume 27, 
1943, was edited by Donald G. Paterson and 
published by the APA with the cooperation 
of the American Association for Applied Psy- 
chology. In 1955, Volume 39, I started a 6- 
year term as editor. In 1961, Volume 45, Ken- 
neth E. Clark became the editor and con- 
tinues today in that position. Porter’s tenure 
lasted 22 years; Paterson held the editorship 
for 12 years. The years of the Journal’s ex- 
istence have been the years of psychology’s 
most rapid growth. New journals have come 
into being in the area of counseling psychol- 
ogy, in the areas of psychometrics and test- 
ing, the area of human engineering, and the 
area of personnel management, to mention 
some of the fields for which the Journal of 
Applied Psychology has provided a publica- 
tion outlet. 

Let us go back, now, to our quinquennial 
sampling of the volumes. Volume 6 covers the 
year 1922. There were 33 articles, 433 pages, 
and 37 authors. The number of dataless es- 
says has dropped to two; four articles that 
must be classified as statistical or methodo- 
logical now appear: Toops described a method 
of computing a matrix of intercorrelations by 
using an adding machine; Hull described a 


Joux G. 


Deter 


TABLE 2 
Peacentaces oF Anticixs tN Vamos Canan oF Ps 


0 
0 
2 
3 
2 
5 
10 
26 
27 


„Teer lar Immediately evident in Volume 1 that the editory inal four fields were i 
— had Additions 


be introduced in the 


Psychology 5 Statistical 
ofeveryday| Essays A methodo- 
activities studies | logical 

6 36 9 3 

3 6 3 12 

13 4 7 7 

13 0 2 10 

20 7 3 7 

12 5 4 14 

9 7 0 14 

7 8 0 7 
10 0 0 6 
1 0 0 13 


insufficient ; thus these other categories 
ly, I have no evidence on the reliability of interjudge con- 


t scheme. Addi 
22 ee categorien, since 1 was the sole Judge of each article's place in the scheme. 


method of deriving sigma scores, and a method 
of computing Pearsonian r's from ranked 
data; Thorndike discussed methods of deriv- 
ing equivalent scores on intelligence tests of 
varying degrees of correlation, means, and 
dispersals; Chapman discussed the problem of 
known and guessed scores in true-false exami- 
nations. 

Ream, in an article describing the develop- 
ment of a test to measure Thorndike’s “social 
intelligence,” mentions socially acceptable 
and unacceptable items, foreshadowing the 
later spate of literature on item format. Pof- 
fenberger concludes an article on setting cut- 
ting scores with the following statement: 


From the psychologist’s point of view it might not 
be difficult to decide upon a critical intelligence score 
for the purposes of this school.. . . But here as in 


most institutions where tests are being established 
the keenest interest centers upon the individual case. 
If one or a few good students be lost, the test may 
thereby be condemned. “If a man have a hundred 
sheep and one of them be gone astray, doth he 
not leave the ninety and nine, and goeth into the 
mountains, and seeketh that which is gone astray? 
And if so be that he finds it, . he rejoiceth more 
of that sheep than of the ninety and nine which 
went not astray.” [p. 160] 


So much for selection ratios, institutional de- 
cisions, and predictive validity. Poffenberger, 
in another article, uses peer ratings as an 
independent variable in predicting the cri- 
terion of expert ranks of letters of applica- 
tion. Max Freyd introduces the methods later 
used by Strong in the measurement of voca- 
tional interests. Root, in one of the two es- 
says, gives a sophisticated discussion of the 


TABLE 3 
PERCENTAGES OF AUTHORS IN VARIOUS TYPES OF EMPLOYMENT 


Year | Volume | Consulting | University | Military | State/city | Federal Private Not 
number | orindustry or college | service service service practice known 

1917 1 14 66 0 8 0 6 6 

0 21 0 0 3 

0 11 0 0 6 

0 15 3 3 6 

0 15 0 0 0 

4 13 0 0 4 

4 2 1 1 0 

5 2 2 2 0 

20 1 1 0 1 

20 1 2 0 0 
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clinical versus the statistical problems under- 
lying the battle over the constancy of the IQ. 
Goldsmith reports the complete standardiza- 
tion of a weighted application blank for use 
in predicting success of life insurance sales- 
men. Naccarati and Lewy-Guinzberg, writing 
on the relation between hormones and intelli- 
gence, display various correlations between 
height-weight ratios and intelligence test 
scores, explaining away the low correlations 
by sampling errors that led to the inclusion 
of too many normosplanchnics in the student 
samples, at the expense of microsplanchnic 
and macrosplanchnics, J. McKeen Cattell an- 
nounces the availability of shares of stock in 
the newly organized Psychological Corpora- 
tion at $10 per share. Knight shows an amaz- 
ing mean difference between teacher volun- 
teers who signed their names and those who 
did not in scores on a scholastic ability test 
in a program introducing them to the idea of 
mental testing. Viteles compares the inter- 
correlations and differential predictive power 
of three different intelligence tests, using col- 
lege grades as criterion measures. His findings 
indicate that tests of presumably the same 
functional domain do not show high inter- 
correlations or equal predictive power. One 
gets the feeling, from Volume 6, that quite 
a few of the classic problems of applied psy- 
chology had been opened up fairly early. 

In Volume 11, 1927, three methodological 
articles discuss partial correlations, weight- 
ing and scaling methods for mental tests, and 
theorems concerning the arithmetic mean and 
standard deviation. Kornhauser, standardiz- 
ing a questionnaire to differentiate between 
good and poor students at the University of 
Chicago, clearly describes the procedure of 
cross-validation, although he does not so 
name it: 


The unreliability of the questionnaire results became 
further apparent through the following comparison. 
In order to check the results obtained from our 
sample of 45 questionnaires, another sample of 30 
... was taken. Of the 30 questions which gave 
the best differentiation in the first sample, only 
eight gave even fairly good results in the new sam- 
ple. The correlation of total scores on the 30 items, 
with average marks, was 0.20 in this group of 30 
people, as contrasted with the 0.73 in the original 
sample Ip. 93]. 


Reporting on two case histories of a pair of 
sisters placed in an orphanage setting, Flor- 
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day a goodly part of the world’s deliberations 


of intelligence, and time available for com- 
mittee activities. Considering that these ques- 
tions are still not clearly understood by our 
colleagues in social psychology, it is interest- 


occasionally in Volume 11. In a job analysis 
of athletes, Charles Homer Bean writes: 


Educators seem to be no longer in doubt about the 

value of games and other athletic sports, not 

for health and for the consumption of surplus en 

ergy that would otherwise bother the teacher, but 

also for the education of both body and mind. The 

boy or girl who enjoys few, or no opportunities for 
athletic 


static, the various psychoneural hook-ups that are 
as all-important to clear thinking as to the efficient 
converting of thought into action [p. 369]. 


The notes and news columns of Volume 11 
contain the following two items: 


“Motion picture theaters are too dark,” Guy A. 
Henry, of New York, General-Director of the Eye 
Sight Conservation Council of America, declared in 
a recent statement. “People are unnecessarily sub- 
jected to eye strain,” he asserted, “in poorly lighted 
auditoriums.” Investigations reveal that managers 
of motion picture theaters have no method of 
determining the effectiveness of the lighting, said 
Mr. Henry, who urged the framing of a special code 
of illumination for motion picture auditoriums, fol- 
lowing a scientific study of the problem. He be- 
lives that this is too important a matter to be left 
to the judgment of individuals [pp. 165-166]. 


Defective eyesight contributes to juvenile delin- 
quency, according to Guy A. Henry, General-Direc- 
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tor of the Eye Sight Comervation Council of Amer- 
ica. Child Health Day, general observance of which 
has been asked in proclamations by mayors of 
cities and other official: throughout the country, 
should be made an event of moral and physical sig- 
nificance In every household where there are chil- 
dren 

Investigation by the Eye Sight Council has shown 
that “bad eyes make bad boys,” and that the ex- 
perience of Juvenile Courts prove that defective 
vision makes children truants. Each year more than 
200,000 children come before these courts, and eye 
conservation In the home and In the schools should 
be employed as a factor In checking criminal tend- 
encies. 

“Juvenile Courts are finding that bad eyesight 
leads to inattention in school, to unfair competition, 
and to disrespect for authority, Bad eyes lead to 
truancy and the truant child is a criminal in em- 
bryo.” Ip. 319) 


The editorial standards for notes and news 


were, apparently, not stringent. 

Volume 16 showed noticeable increases in 
total pages, number of articles, and number 
of authors. However the proportions of arti- 
cles in various fields of applied psychology 
did not change markedly. 

Two items from the Notes and News sec- 
tions of Volume 16 are interesting: 


According to a recent report by the New York 
Commission on Ventilation there are still twenty 
states in the Union which retain statutes concerning 
ventilating devices which are based on disproved 
or antiquated theories. Dr. C.-E. A. Winslow, chair- 
man of the Commission, estimates that an unneces- 
sary expenditure of nearly $2,500,000 annually is 
made in an attempt to conform to obsolete regu- 
lations requiring the installation and maintenance in 
school buildings of a system of mechanical venti- 
lation which science has proved to be not only 
unnecessary but even a menace to health [p. 97]. 


Syracuse University announces the establishment of 
an independent department of psychology, with Dr. 
Ernst Thelin as head of the department. Until the 
present time, psychology and philosophy have been 
operating as a single department at Syracuse, de- 
spite the fact that in nearly all other universities 
they have established separate organizations. At 
Syracuse such a change was deferred, owing to the 
satisfactory operation of both these branches of 
teaching under the head of the Department of Phi- 
losophy. But in recent years psychology has grown 
at Syracuse from a simple department in Philosophy 
to a group including 16 sections of Introductory 
Psychology and 18 advanced courses, with a total 
enrollment of 1,000 students Ip. 98]. 


The only case study reported in Volume 16 
is S. H. Britt’s “Examination of Miss Gene 
Dennis, ‘Psychic’.” Miss Dennis was appar- 
ently laying them in the aisles in St. Louis in 


her theatrical performances as “a real seer 
p. 691].” Dr. Britt, having examined her at 
her own request, states: 


The experiment seems to justify the following con- 
clusions with regard to Miss Dennis: 

1. In terms of IQ, she was slightly below average 

2. In the solution of a learning problem, she was 
considerably less efficient than the average college 
student. 

3. As to “mind-reading” ability, she failed to dem- 
onstrate any. 

4. In no part of the experiment was there evi- 
dence of “psychic” powers [p. 694]. 


There is no evidence of the effect of these 
conclusions on Miss Dennis’ subsequent ca- 
reer. 

Volume 16 carries an excellent study of 
faking of vocational interest test scores by H. 
C. Steinmetz; among his suggestions: “Further 
experimentation of this sort with personality 
tests is suggested for measurement of social 
insight and the extent to which purpose may 
distort test results [p. 130].” 

An interesting observation in Volume 16 
on something akin to socialized medicine is 
found in the following introduction to an 
article by Keller and Weber: 


At present all persons seeking to prepare for the 
dental profession in Germany must take and suc- 
cessfully pass a battery of searching tests. Before 
proceeding to an account of the tests it will be well 
to say something regarding the training and status 
of the dentist in Germany. The dentist is not to be 
confused with his academic competitor, the Zahn- 
arzt. The Zahnarzt is fully educated in the academic 
sense and holds a full medical degree. He is a medi- 
cal doctor, perhaps also a surgeon, who has ‘special- 
ized in diseases and troubles of the teeth. The den- 
tist on the other hand is a practical though perhaps 
highly skilled technician. We are speaking here of 
the Staatlichgeprueften Dentisten who form the Na- 
tional Association of Dentists (Reichsverband 
Deutscher Dentisten) with head offices in Berlin. 
Among others these dentists are permitted to take 
as patients the large number of people who are 
insured in the Krankenkasse, one of the great social 
institutions of Germany. Certain classes of people, 
particularly laborers, are required to pay small fees 
to the Krankenkasse as a premium for medical serv- 
ices to which they are entitled when need arises. 
This system has been found successful in combating 
the tendency which is world-wide to neglect ills— 
naturally one takes advantage of medical services 
for which one has already paid [p. 465]. 


Their evidence on validation is given briefly: 
“Tt has never occurred that a candidate who 
was found ‘suitable’ in the test was after- 
wards found unsuitable in college work, ex- 
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cept in one case in which, bowever, a psy- 
chosis was involved [p. 474].“ 

Volume 16 seems to have a report on the 
first use of subliminal stimulation affecting 
consumer behavior, Entitled “How the Con- 
sumer Estimates Quality by Subconscious Sen- 
sory Impressions,” the article describes a field 
test of four identical pairs of women's stock- 
ings treated with traces of four different 
scents. The author blandly quotes from a 
local newspaper the reason for discontinuing 
the field work: 


of 
ings which were perfumed and 
ings that were without the artificial odor and their 
errand was to learn whether the 
consider the perfumed article mo: 
those without perfume Ip 245], 


Laird's own conclusions: 


Two things are plainly indicated by this experiment: 
sensory impressions re- 


in determining the housewife's judgment of quality 
in silk hosiery. 2. Scents of one type are more influ- 
ential in determining this judgment of quality than 
are scents of another type Ip. 246]. 


Applications of psychophysical methods 
are not missing in Volume 16. A modest study 
by G. R. Wendt, then of the Yale Institute 
of Human Relations, presents a modest sum- 
mary: 


i 


when its pattern is up of larger units. The 
amount of this apparent brightness difference was 
measured. In a second application of the same 
method it was found that the character of cover 
markings of golf balls has no effect on their apparent 
size, 


the psychology of advertising. Except for an occa- 
sional study of consumers’ preferences little measure- 
ment of psychological factors in industrial products 
has been attempted. The field is one which recom- 
mends itself not only because of its range of possi- 
bilities but also because of the relative ease with 
which its problems can be solved [p. 276]. 


Volume 21 finally contains an article clearly 


strokes seeded to play a hole is a straight line func- 
tion of the length of the bole ... The average 
effective length of the drive of the composite tourna~ 
ment winner is 228.96 yards. . . . The effective length 
of the full second shot as played by the tournament 
winners is 228.96 yards... . The humanly perfect 
shot h one having a deviation of approximately 5° 
on both sides of the intended line of Might of the 
hall Ip. 184). 


These “laws” are cold comfort to weekend 
golfers. 

Three articles in this volume deal with 
automobile driving and traffic problems, the 
first such articles in this sampling of the 
Journal. Andrew presents the first factor 
analysis seen in this quinquennial review, in 
her report on the standardization of the Min- 
nesota Vocational Test for Clerical Workers. 

In a study of the job satisfaction of psy- 
chologists, Hoppock sampled the APA mem- 
bership in vocational or industrial psychology; 
out of 203 job satisfaction questionnaires 
mailed, 66 psychologists responded. “During 
the four years from 1932 to 1936 the average 
salary had decreased $157 a year to a 1936 
average of $3,261 [p. 301].” The good old 
days! 

The formation of the new American Asso- 
ciation for Applied Psychology is mentioned 
in the Notes and News columns and in a 
more extended article by Fryer, following 
upon the APA meeting in Minneapolis in 
1937. 

Another excellent example of cross-valida- 
tion, though not so called, appears in the arti- 
cle by Helen Messenger entitled: “An Tllus- 
tration of New Techniques in Test Construc- 
tion [p. 311].” A little-known and, I must 
admit, long-forgotten article by Darley (page 
485) bears a fair resemblance to what later 
came to be called studies of moderator varia- 
bles; a similar study of moderator variables, 
also affecting the prediction of academic suc- 
cess, is presented in a study by Wolf (page 
304). 

Volume 26, 1942, marked the last year of 
Porter’s editorship. It contained 81 articles 
and 887 pages. The first appearance of the 
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analysis of variance, in this quinquennial re- 
view, is found in two articles; one, by Brown, 
Fisk, and Torkelson, reports on recovery from 
glare blindness under conditions of changes in 
dietary intake of Vitamin A; the other, by 
Jackson, Jerome, and Schoenfeld, reports a 
study of the effectiveness of deodorant creams! 

World War II makes its presence felt in 
Volume 26 in the Notes and News columns; 
in one article on Naval recruit testing by 
Lewinski (page 316); in an essay by English 
on the applications of psychology to problems 
of military training (page 3); in Link’s re- 
port of attitude surveys of workers in a war 
economy; in Ruch and Young’s analysis of 
the effectiveness of Axis propaganda; and in 
an early and favorable book review of Tol- 
man’s Drives Toward War. Psychology’s re- 
sponse to World War II reached journal pub- 
lication somewhat more rapidly than was the 
case in World War I. 

Another excellent example of cross-valida- 
tion is found in the article by Ryan and 
Johnson (page 543) on the use of interest 
measures in the selection of salesmen and 
servicemen. 

Volume 31, 1947, is the first volume sam- 
pled here under Paterson’s editorship. As 
Table 1 will show, 27 articles (or 34% of all 
articles in the volume) represented early pub- 
lication paid for by the authors. One gets the 
feeling the option was somewhat overused. 

The first extended discussion of the field of 
human engineering or biomechanics appears 
in Volume 31; Dunlap’s presidential address 
to the Division of Consulting Psychology at 
the Denver APA meetings (1947) describes 
the wartime emergence of the field and pre- 
dicts additional areas of application in forth- 
coming years. As will be seen in Table 2 , the 
little used category of “influence of environ- 
mental conditions,” in Volume 31 and subse- 
quent volumes under review, becomes heavily 
loaded with human engineering articles, 

Clark, in the midst of many favorable re- 
ports of military applications of psychology 
in these volume years, takes from Navy re- 
ports examples of projects that were of lim- 
ited value or were total failures in contribu- 
ting to military efficiency (page 94). This 
article will be required reading at the start of 
World War III. 

Volume 36, 1952, gives a dramatic illustra- 
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tion of editorial power; as Table 1 indicates, 
it contained 97 articles, the largest number in 
the volumes reviewed here but packed within 
only 433 pages that also included book re- 
views and other materials. Since the propor- 
tion of these articles in the various areas of 
applied psychology did not differ markedly 
from other years, readers were merely exposed 
to more of the same diet, with the notable ex- 
ception of a decrease in articles about adver- 
tising research. One of these articles (Twedt, 
page 207), however, applies factor-analytic 
methods of mechanical and content variables 
in ads as predictors of a readership criterion; 
it is the most effective theoretical study of 
advertising to be found in the volumes re- 
viewed. 

A pleasant air of controversy appears in 
Volume 36; Eells and Darley exchanged 
rather brisk comments on Darley’s unfavora- 
ble review of Eell’s coauthored volume on in- 
telligence and cultural differences; Farr, Jen- 
kins, Paterson, and England exchange bland 
rebuttals with Klare and with Flesch regard- 
ing their earlier article on the simplification 
of Flesch’s reading ease formulas. Bernard 
criticizes Rock's ‘Sales Situation Test”; 
Rock answers Bernard’s criticism, and Pater- 
son replies to Bernard’s passing comment 
that he shouldn’t have published Rock’s origi- 
nal article in the first place! With regard to 
industrial applications of psychology, this 
volume contains examples of more sophisti- 
cated studies of industrial organizations, as 
distinct from prediction studies of perform- 
ance of personnel. An article by C. G. Brown 
and B. J. Neitzel, entitled “Communication, 
Supervision, and Morale,” illustrates this 
shift in emphasis (page 86). 

In Volume 41, there is a noticeable increase 
in the number of articles classified as human 
engineering (in the area, “influence of the 
environment”). I was serving my third year 
as editor; APA policies had set an allowance 
of approximately 400 pages per volume year 
for this Journal. I seemed to have continued 
Paterson’s emphasis on short articles, or at 
least contributors had continued to submit 
shorter articles after Paterson’s terms as edi- 
tor. A somewhat increased level of sophistica- 
tion in choice of problems and applications of 
methods seems to appear in this volume, but I 
hasten to disclaim any editorial credit for this; 


j 


1917: A JOURNAL 1s Born 9 


sooner or later the field simply ad to become 
more sophisticated! 

By Volume 46, 1962, Kenneth E. Clark 
had assumed the editorship of the Journal. In 
Volume 41 and again in Volume 46, an in- 
crease in the proportions of authors from 
other than academic employment may be seen, 
reflecting in part the changing nature of em- 
ployment patterns in psychology. (See Table 
3.) 

My colleagues in history find themselves ill 
at ease in the years of the present; they can 
neither identify trends nor evaluate factors 
producing change. I find that I share their 
malaise as I move to the present years of the 
Journal oj Applied Psychology. It would be 
well, therefore, to draw the curtain of this 
review with Volume 46, 1962. 

But what of the past and what, in broad 
outline, can be learned from it? Several gen- 
eral observations come to mind as I return 
these 10 dusty volumes to their dusty shelves. 

Plus ça change, plus c'est la même chose. 
There is obviously no domain of knowledge 
or unitary field of psychology that can be 
subsumed under the word “applied”; rather 
there are many ways and places to apply psy- 
chology. In large part, these were nearly all 
foreshadowed in the early volumes of this 
Journal. Today’s concepts of cross-validation, 
moderator variables, and question format, for- 
example, are easily discernible in early arti- 
cles. As Xhignesse and Osgood (1967) have 
recently pointed out in studying the time di- 
mension of journal citations, 


. . . the time perspective of psychology is short and 
getting shorter. If this is true, then it would seem 
that psychology as a science is in danger of “for- 
getting where it has been,” of repeatedly rediscover- 
ing facts and theories that have been well worked 
in the past [p. 790]. 


Certainly applied psychologists seem to have 
a cavalier attitude toward their own history. 
And, as Santayana pointed out, those who for- 
get their own history seem doomed to relive 
it; there is much that is repetitious in the 
enterprise. 

At the same time, there was and still is an 
air of excitement and discovery as one follows 
the Journal of Applied Psychology over time, 
seeing on occasion a new application, an inter- 
esting application, or an exciting methodologi- 


cal solution to a persistent problem in human 
affairs. That such articles are rare or infre- 
quent is not, I submit, characteristic of this 
Journal alone; the truly seminal or important 
articles in any area of psychology are infre- 
quent events. 

Another question: where are the tests of 
yesteryear? Many tests appear in early arti- 
cles that by now have disappeared into well- 
deserved oblivion, as the psychotechnology 
and standards of test construction have be- 
come more sophisticated and rigorous. Hall 
did not foresee this, nor did he foresee the 
contribution that test developments and sta- 
tistics would make to our understanding of 
the organization of behavior. The Journal of 
Applied Psychology has always been an out- 
let for this area, for strong and weak articles. 

In the main, authors in this Journal have 
been writing about performance differences 
related to age, sex, race, special group mem- 
bership, or (rarely) special conditions set by 
an experimenter. These studies, in Cronbach’s 
(1957) definition of the two sciences of 
psychology, were concerned with events that 
nature herself had defined and created; they 
also, in more modern terms, dealt both with 
construct validity and predictive validity. 
In any new science, many such studies are 
needed to clear away the underbrush of “com- 
mon sense,” to test assumptions of causality 
or covariation, to develop methods and experi- 
mental expertise, and to open up new areas 
of study in the field and in the laboratory. 
The pages of the Journal of Applied Psychol- 
ogy reflect this evolution, with both its 
strength and its weaknesses, its redundancy, 
and its infrequent points of high discovery. 
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LATERAL VISUAL FIELD AS RELATED TO AGE AND SEX 
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From 1961 through 1966, the author di- &. bead on the axis ne of the ar The 
7 cale r arch 4 . targets are approximately 69 cm. from the center 

rected a large i a poe * of the light source. A black cloth h suspended from 
galing the relationships between vision and ihe lower edge of the arc to screen out E's arm 
driving. The major conclusions from this study movements when he is manipulating the target 
are presented in a recent report (Burg, plunger 
1967b). Three testing units were in simultaneous operation 

Apart from data relating Lat to driving during the course of the data-collection phase of 
the sudy produced a very great amount of *Target illumination is 7.5 fte. measured per 
information concerning visual performance as pendicular to the light source-to-target axis, When 

ng p 

a function of age, sex, and other variables. measured perpendicular to the subject-to-target line 
The present paper is one of a series of ts of sight, target illumination is approximately 2 ftc 
(including Burg, 1966 and 1967a) presenting 
normative data on vision that should be of 
interest to others working in this area. 


Metnop 
Apparatus 


Gepreming a spring-loa 


arc, 
soon as the plunger is released. Each target subtends 
approximately 46 min. of arc. At each end of the 
arc is a sigh’ device for obtaining the correct eye 
level and 30 cm. distance from the cornea to the 
fixation point (a mm. white target in the center 


monocular testing. The arc is evenly illuminated by 
., 120-¥, incandescent bulb mounted above 


a 0-8, * 1 

Data collection and analysis costs were partially : 
borne by the United States Public Health Service i 
(Grant AC-0001S), and the State of California and | 28 98 
United States Bureau of Public Roads (California - — 3 
Standard Agreement 13600). 


Fic. 1, Lateral visual field screening apparatus. 
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Fic. 5, Right temporal visual field by age and sex. Fic. 7. Right-eye visual field by age and sex. 
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valid perimeter scores (e.g., they failed to 
maintain steady fixation upon the central 
target), or because they were blind in one 
eye. (Because of their special nature, one- 
eyed Ss are being analyzed separately, and 
will be the subject of a future report.) 

Inconsistencies among the cell sample sizes 
evident in Table 1 are due to the fact that 
different computer runs provided data for dif- 
ferent sections of the table, and a few Ss had 
invalid scores for some, but not all, of the 
variables under consideration, 

Upon inspection of the above table and 
graphs, the following general statements may 
be made: 


1. Field versus Age: 


(a) For both sexes, total visual field and 
both temporal fields are at their maximums 
from age 16 up to about age 35, after which 
point field decreases progressively with ad- 
vancing age. 

(5) For both sexes, nasal fields and (there- 
fore) eye fields increase up to a maximum 


TABLE 1 
SUMMARY oF MEAN VISUAL FIELD Scores BY AGE AND SEX 
Left nasal | Right nasal | Left tem- Right tem- Left-eye Right-eye ; 

field field poral field | poral field | visual field | visual fiela | Total visual feld 

ig Sex (in degrees) | (in degrees) | (in degrees) | (in degrees) | (in degrees) | (in degrees) gree: 
REO wae EN fox | N X | N z N Soper N | sp 
16-19 M | 1181| 54.9| 1181 55.1| 1181 87.8 1181| 87.0 | 1181 | 142.4] 1181 | 142.1] 1182] 174.5] 9.1 
F 732 | 58.3 | 732 38.3 732 88.7 732| 87.8 | 732 1470 732 146.1 32 174 5.5 
20-24 M 1380 56.2 | 1381 56,0 1380 87.8 1381 87.3 | 1380 1440 1381 143.3 1379 | 175.2) 7.7 
F 775 58.5 776 58.9 775|884| 776 880| 775 14% 77 14% 776/176.1| 8.8 
25-29 M 1159 36.5 1159 56.8 1159 87.6| 1159] 87.2 | 1159 | 144.1 | 1159 144.0] 1158] 174.9] 7.5 
F 604 | 58.7 | 664 5% 664 88.3 664| 87.7 | 664 147.0] 664 14% 665| 176.0] 9.0 
30-34 M | 1118] 57.2} 1117 57.3] 1118 87.2| 1117] 87.3 | 1118] 1444] 1117] 144, 1116] 174.6] 7,7 
F 629 50.3 629 5% 629] 88.0 629] 87:7 629 147.3] 62% 147.2] 631 175.0] 7.2 
35-39 M | 1118| 56.7| 1114 572 1118] 86.9] 1114] 86.4 | 1118] 143.6] 1114 | 143.6] 1114] 173.4] 9.8 
F 691 59.5 602 50.0 691 87.9] 692| 87.4 | 691] 1474] 692] 147.4] 690] 175.1 6.7 
40-44 M 1121 5% 1120] 56.9| 1121] 86.2] 1120 86.0 | 1121] 1428] 1120 | 142.9 | 1119 |172.5| 9.2 
F 718 59.2| 717 59.7] 718 88.7 717 866] 718 145.9] 717 |146.3| 17 1735 7.6 
45-49 M 937 56.5 537 36.9 937] 85.6 937| 85.5 | 937 1421 937 1424 935) 171.2| 8.6 
F 640 58.1 640 38.6 640 85.6) 640| 86.0 | 640 143.7] 640] 144.6] 640] 171.8| 10.0 
50-54 M 852 | 56.6 | 352 56.0 852 84.2 852| 84.3 852 1403 852 140.3 849 168.4 11.5 
F 577 873 | 576|57.7| 577 85.9) 576| 85.5 | 577 | 1432 576 143.5761715 9.7 
55-59 M 633 | 56.2 | 629 55.9| 633 82.4) 629| 83.5 | 633] 1386 629] 1394| 629 |1659 | 13.4 
F 374 57.0 375|57.4| 374843 375] 83.9 | 374| 1413] 375 141.3| 374 168.0| 10.3 
60-64 M 488 54.6 488 54.8 488] 80.4 488| 80.9 | 488 135,0 488 135.4] 487] 161.5 16.9 
F 265 56.3 265 56.4 265| 81.7 265| 82.5 265] 138.0] 265 138.9 265 103.7 14.4 
65-698 | M 883 | 528| 8383 53.1 883 75.3 883| 77.6 | 883 128.1 883] 130.7) 401 158.0 16.5 
F 350 58.1 350|57.9| 350 78.1 350 81.2 | 350 136.2) 350 139.1| 205 | 160.6 | 12.5 
70-74 M 266 | 152.8 | 16.3 
: 4 ETSI ats 
1 3e gat 
“i E 0861 | 85.2 | 10870 | 140.9 | 10861 | 140.9 10849 | 170.3 124 
.6 | 10861 | 55.7 | 10870 | 85.3 | 1 i : g 170. 5 
f E 10419 8700 6416 38.1 6415 86.6 6416 86.2 | 6415 1444 6416 | 144.3 | 6409 | 172.9 10.3 
All Both 17285 | 36.4 17277 56.6 | 17285 | 85.8 17277 85.6 | 17285 | 142.2 | 17277 | 142.2 | 17249 | 171.6 11.7 


Values given are for ages 65 and over for all measures except total visual field. 
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occurring about age 35—40, followed there- 
after by progressive decreases with increasing 
age, with the exception of an increase for fe- 
males age 65 and over. 

(c) For both sexes, right-temporal and 
right-eye visual fields are smaller than their 
left counterparts for the younger age groups, 
while the reverse is true for the older age 
groups. This effect is not exhibited for the 
nasal fields. 


2. Field versus Sex: 

(a) With the exception of the (relatively 
small) over-80 age group, females consistently 
demonstrate larger nasal and temporal visual 
fields (and, consequently, larger eye fields 
and total fields) than do men. The greatest 
difference is evident in the nasal fields. 

(5) For both nasal and eye fields, females 
achieve their maximum values between the 
ages of 35 and 40, while for males this maxi- 
mum occurs between ages 30 and 35. 

An additional finding is that the product- 
moment correlation between total visual field 
and age is slightly but significantly less for 
females (—.420) than it is for males (—.462). 


Discussion * 
Field versus Age 


The results of the present study, showing 
a progressive decline in visual field with in- 
creasing age, are consistent with most previ- 
ous research studies of a similar nature. How- 
ever, the interpretations of findings of this 
type have not been consistent. As Weale 
(1963) points out, one reason for this lack of 
consistency is the fact that senile degeneration 
of the visual mechanism takes several forms, 
some of which may intrude upon visual-field 
measurements and consequently lead to mis- 
taken conclusions as to causal relationships. 
For example, the assumption usually made 
states that visual field constricts with age due 
to peripheral degeneration of the retina, How- 
ever, it is well known that sensitivity to light 
diminishes with increasing age, possibly due 
to the gradual yellowing of the crystalline 
lens and/or because of reduction in pupil 


„The assistance of Frank A. Brazelton (Optome- 
try) and Glenn O. Dayton (Ophthalmology) in in- 
terpretation of the study findings is gratefully ac- 
knowledged. 


diameter (senile miosis), and that visual 
sensitivity decreases as the peripheral retina 
is approached. As a consequence, visual-field 
measurements using perimeters whose arcs 
are evenly illuminated at a fixed intensity, 
Weale continues, may show decreasing field 
with increasing age because of decreased light 
sensitivity and not necessarily because of a 
true peripheral degeneration of the retina. 
Weale concludes that if stimulus intensity 
were increased to compensate for these effects, 
restrictions in visual field would be shown to 
be caused primarily by physical changes in 
the ocular system, rather than by retinal 
decay. 

Regardless of the validity of Weale’s view- 
point, from a practical standpoint the prob- 
lem he raises is not particularly germane to 
the present study. The research herein de- 
scribed is concerned with investigating the 
functional limits of visual field by means of a 
measurement technique that is of sufficient 
accuracy and ease of administration to permit 
its use as a high-volume testing device (e.g., 
for driver-license applicants). Thus, the em- 
phasis is on rapid visual screening rather than 
on detailed clinical evaluation. Practically 
speaking, if measurement conditions are iden- 
tical for all Ss, the fact that visual field 
constricts with increasing age is of importance 
regardless of the physiological reasons under- 
lying this change. In the performance of a 
task such as driving, in which visual-field re- 
strictions do appear to act as a handicap 
(Burg, 1967b), establishing the fact of these 
restrictions is a proper and important function 
of the licensing agency, while etiological diag- 
nosis and possible remedial measures (if any) 
are the responsibility of the clinician. 

One additional point needs clarification. 
The visual-field values derived in the present 
study are of importance primarily as indica- 
tions of relative size as a function of age and 
sex. It is not intended that the obtained mean 
scores be considered completely accurate in 
any absolute sense, for at least two reasons: 


1, The screening test used provides scores 
that may underestimate the true values, be- 
cause the targets are spaced at 5° intervals. 
From a statistical standpoint, a closer ap- 
proximation of the true field values could be 
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obtained by adding 2.5“ to each mean nasal- 
or temporal · field score, and 5.0° to each mean 
eye · ſield or total-field score. 

2. The target illumination used was some- 
what lower than is commonly found in peri- 
metric investigations and, therefore, may 
possibly have slightly penalized the older 5 
because of the light-sensitivity decline men- 
tioned earlier. If such a penalty was in fact 
imposed, however, in all likelihood its magni- 
tude was minor, because of the substantial 
size and reflectance of the white perimeter 
targets used. 


In contrast to the temporal fields, the nasal 
fields initially increase in size before com- 
mencing their progressive decline with age. 
There is no obvious explanation for this find- 
ing; conceivably, it may be due to a physiog- 
nomic change in eye-nose relationships oc- 
curring up to ages 35-40. If such is the case, 
this change reverses direction in the older age 
groups, where gradual recession of the eye 
(relative enophthalmos) causes the nose in- 
creasingly to intrude upon the nasal field 
(Weale, 1963). Both nasal and temporal 
fields show marked declines after ages 35—40; 
if anything, however, the temporal fields ex- 
hibit a greater decline (both absolute and 
relative) than do the nasal fields. 

Similarly, at present no readily acceptable 
explanation can be offered for the reversal in 
relative size that seems to occur, with age, be- 
tween the right temporal and monocular fields, 
and the corresponding fields for the left eye. 
As yet unpublished monocular visual acuity 
data for these Ss do not exhibit correspond- 
ing changes with age. As is possible with any 
research study, this finding (as well as other 
results) may be the consequence of an arti- 
fact in the data; however, in view of the 
large sample size and the extreme care taken 
to maintain standardized experimental condi- 
tions, the likelihood of experimental artifact 
providing a reasonable explanation for this 
outcome appears quite small. 


Field versus Sex 


No obvious explanations can be offered 
either for the slight but significant superior- 


ity in visual-field size demonstrated by fe- 
males or for the fact that nasal and (conse- 
quently) eye fields peak at a later age for 
females than for males. The fact that visual 
field correlates with age slightly less for fe- 
males than for males can be explained, how- 
ever, by the smaller sample size for females, 
which results in a narrower range of scores 
(see the standard deviations presented in Ta- 
ble 1) and, consequently, a statistical limita- 
tion on the size of the correlation obtainable. 

In summary, it may be said that several of 
the study findings pose interesting questions 
of interpretation. Additional research obvi- 
ously is necessary if reasonable answers to 
these questions are to be found. For example, 
it would be desirable to obtain field measure- 
ments on a large number of Ss with higher 
levels of target illumination, physiognomic 
evaluation, pupil-size measurements, and so 
on. Data collected in the present study are 
currently being subjected to more detailed 
analysis to determine whether, despite the 
precautions taken, those Ss who were tested 
wearing glasses (some 32% of the total) 
were in fact experiencing restrictions in their 
field while being measured that could in some 
way account for some of the findings. Other 
analyses in the planning stages include the 
effects of smoking and fatigue (lack of sleep) 
upon visual performance; however the likeli- 
hood is small that either of these analyses 
will provide satisfactory explanations for any 
of the findings described above. 
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would be more expensive, since it 
would require an extra binary unit of informa- 
tion to distinguish between the upper and 
a printout of combined 


* The authors are grateful to M. A. G. Howgate 
of the British Government Communications Head- 
quarters for supplying the Siemens and dyeline 
Siemens teleprinted material used in the experiment. 
P. R. Freeman kindly supplied the Greco-Latin- 
square design, and P. M. E. Altham gave advice 
on how the resulting data were to be analyzed, 
Financial support from the British Medical Research 
Council is also gratefully acknowledged. 
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belts were given short pamages to study for limited periods of time. 


+ appeared to have about equal potential power statistically, but 


size is for text, and the larger size is 
reserved exclusively for headings, use of the 
typewriter shift key is virtually eliminated. 
This makes typing easier and hence less ex- 
pensive. In comparing passages of prose typed 
in the two styles, Fox used as his criterion 
the speed of reading without measuring com- 
prehension. He found that text typed in small 
capitals was read 18% more slowly than the 
standard elite text, but only when read first. 
On changing to the other style the readers 
did not change their reading speed appreci- 
ably, presumably because they knew that 
their level of comprehension was not being 
measured. Similar transfer effects have been 
reported in other situations (Poulton & Free- 
man, 1966). Clearly, more work is required 
in order to assess the size of the likely gain 
to the reader, if any, of converting teleprinters 
from all capital letters to combined upper- and 
lower-case letters. 

The classical experiment of Tinker and 
Paterson (1928) suggests that text typed in 
combined upper- and lower-case letters ought 
to be easier for the reader to comprehend 
than text typed all in capitals. These authors 
used the Chapman-Cook speed of reading 
test printed in 10-point “old style” type, 
Presumably without leading. This method 
also measures speed of reading, but demands 
a certain level of comprehension since in- 
congruous words in the passage have to be 


Tesereorres Ovrrer * 


M 

7 LEP 

e 
tr 1211 H J 
BITH iil i 
ie tere 


f 
i 
f 
zE 
: 
i 
i 


investigating. 


by 
. But its power is reduced by 
large sizes of the individual differences if the 
samples of individuals are at all representa- 
tive (Poulton, 196 8a). In contrast, a Greco- 
Latin-square design produces unmeasurable 
transfer effects, which may reduce the sizes 
of the differences between conditions (Poul- 


research area (Poulton, 1965b). 

A fourth aim was to see whether the style 
of the question sheets affected the scores for 
comprehension. In experiments comparing all 
capitals with mixed upper- and lower-case 
newspaper headlines, it was found to make no 
difference to the experimental comparisons 
whether the question sheets were typed in 
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i Carbon or dyelise copies of test typed 
bs the top wee styles were compared. (The betters 


reported) 
pieces of newspaper with all the headlines in 
capitals were found to favor reliably (p <.01) 
test newspaper sheets with all the headlines in 
capitals. This suggests that although there 
may be transfer effects between the styles in 


ence. If so, it could save a lot of effort, espe- 
cially with Greco-Latin-square designs, since 
0 


compared. 
The separate-groups experiment compared 
the existing Siemens teleprinter typeface, 


upper- and lower-case letters, the pica type- 


typewriter were used (see Table 1). 
IBM 72 pica capitals were included as a pos- 
sible straight alternative to the Siemens 
capitals. Figure 1 shows that they are serifed, 
as compared with the Siemens capitals which 
do not have serifs, but they are not quite so 
tall. Here the stencils for the question sheets 
were typed like the text in IBM 72 pica 
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tale, to see whether this helped the reader. TABLE 2 
Pye group was alko run which read the Resear oF GarcoLaroeSquaer E APESIMENT 
passages in Siemens capitals, and had dyelined — — ‘ : 
question sheets abo in Siemens capitals. For . — 
the remaining groups the stencils of the ques- myn yee F T 
tion sheets were typed in the standard Olivetti ie J — 
Lexikon 80 elite combined upper - and lower- Sinith-Corona elite 66 combined 2 
and lower X 
— — ARSE ner typing Sualth-Corenn pice bal lower case 2 
Senith-C elite 66 all lower 005 
The Greco-Latin-sqware experiment com- n nee 9) 
pared the Siemens teleprinter typeface with Smith-Corona elite 66 all upper case 587 
a dyeline copy of the teleprinter output, and —Dyeline Siemens all upper case S&S 


with capitals from an elite 66 Smith-Corona — ee ai 
electric typewriter (see Figure 2). Two sets o b lz fie were reliably worse tham 


min Glai per mike can ton Sacer taser tm,“ a 
writers. Combined upper and case top 0 ng paper. 
typewriter There were 10 open-ended questions on 
the Smith-Corona elite 66 electric provide Passe to test for comprehension. Noe h 
questions ty i pper- and lower-cai 
a base line. The stencils for all the question Giver Lethon 29 dite lden on 1 r oad 
sheets were Olivetti e 
tal conditions indicated by Footnote b of 
Lexikon 80 elite combined upper and lower Gas, 6: wns ce 
9 responded to the lettering of the test passages. Th 
question sheets for the IBM 72 Pica all- upper - 
Mrrnop passages were stenciled copies. The question sh 


j for the Siemens all-upper-case passages were dy 
rn Ul. The quot were sad ery owed 
. — 2 typed in the tert. There were also passages of about 2,000 oF 
* were typed typed in the styles of Table 2. These were used fn 
Myles ‘shown in Tables 1 and 2 and Figure 1, The the Greco-Latin-square experiment to familiarize the 
r readers with the styles before they were given the 
po ay egy mg — * 8 test passages. The Ss were tested in groups of abo 
"a dozen. 
Fc 
which have carbon interleaved. The first was allocated to one of the styles listed in Table 1. 
enden paper the ' All the passages were typed in all the styles an 
copy C were read in the same order by all Ss. After t 
7, purpose of the experiment had been explained 
A them, Ss were handed two passages each of b 


450 words, comparable to those upon which 

were to be tested, and told to look them over 
TABLE 1 3 min. This was followed by a practice pas 
which had to be studied for only 75 sec. When 
was up, each S had to mark with a pencil the p 
in the passage up to which he had read, and 
to answer 10 practice questions on the content 
the passage. Unlimited time was available 
answering, but Ss seldom took longer than 4 
Finally a test passage was read under cond 
identical to the practice Passage, and 10 test questi 
on it were answered. The whole experiment x 
about 25 min. The 18 Ss who answered correctly 1 
or less of the 10 test questions were discarded. 
Square experiment. The 96 Ss 
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y typed in each of the six styles listed 

The question sheets had the same style of type as the test Table 2. The six passages, each of which was ty 
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seen that the lettering of the question sheets 


made little difference (p > 05). If anything, 
the Siemens text with the dyeline Siemens 
question sheets was slightly 


less well than the Siemens text with the 
stenciled upper- and lower-case question 
sheets, 48.6% compared to 49.3%. 


The table shows that the pica all-upper-case 
text was comprehended if anything slightly 
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remaining styles (p < 05). 
shows that the difference in average compre- 
hension between the combined upper- and 
lower-case text and the all-upper-case texts 
ranged from 3.7 to 4.4%, which become 5.9 
to 7.0% when expressed as percentages of 
the average comprehension of the combined 
upper- and lower-case text. The all-lower-case 
text again lay intermediately. 


F 


hence serifed, or Siemens or dyeline Siemens 
without serifs. The all-lower-case text fell 
between, whether typed in pica or elite. There 
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was little difference between text typed in 
the various styles of all-upper-case letters. 
The pica and elite all-lower-case texts were 
also about equal. The differences between the 
combined upper- and lower-case texts and 
the all-upper-case texts are in line with the 
results of Tinker and Paterson (1928) re- 
ferred to in the introduction for passages 
printed in 10-point “old style.” 

These results for comprehension suggest 
that if teleprinters are to have new typefaces, 
not a great deal is to be gained by changing 
from the existing all upper-case Siemens 
letters to the pica or elite all-lower-case letters 
used in this experiment, as long as the ma- 
terial has to be read and understood. The 
greatest gain from the point of view of the 
reader, who has got to understand and take in 
the information presented, would be a change 
to combined upper- and lower-case letters. 
This unfortunately demands a rather more 
elaborate form of coding for automatic trans- 
mission, and hence some additional expense. 

Comparing the results of the two different 
experimental designs, the means for compre- 
hension are rather higher in Table 2 because 
90 sec. were allowed for reading each passage 
instead of 75 sec., and because Ss had be- 
come more practiced at reading and answer- 
ing the questions by the time they had done 
so six times over, as required by the Greco- 
Latin-square design. The differences between 
the means of the combined upper and lower 
case and the all upper case in Table 2, 3.7- 
4.4, are rather smaller than the corresponding 
differences of 5.8-8.2 in Table 1. This is un- 
likely to have been due to the use of pica 
combined upper and lower case in Table i 
and of elite in Table 2. For the pica and elite 
all lower case gave almost identical results in 
Table 2. And the small difference between 
pica all upper case and Siemens in Table 1 
in favor of Siemens, corresponds to the small 
difference between elite all upper case and 
Siemens in Table 2, again in favor of Siemens. 
Thus the pica and elite typefaces appear to be 
about equally effective. The larger differences 
between conditions in the separate-groups 
experiment of Table 1, compared with the 
Greco-Latin-square experiment of Table 2, 
could have resulted from transfer effects be- 
tween conditions in the Greco-Latin-square 


experiment reducing the average sizes of the 
differences (Poulton & Freeman, 1966). 

However, the potential power of the two 
alternative experimental designs to discrimi- 
nate statistically between combined upper and 
lower case and all upper case is probably of 
about the same order. The Greco-Latin-square 
experiment did not make full use of the po- 
tential power of a factorial design. Its original 
purpose was to make comparisons between 
three upper-case alternatives, and between two 
all-lower-case alternatives. A single com- 
bined upper- and lower-case condition was 
included simply as a base line. If there had 
been two combined upper- and lower-case 
conditions, and only two all-upper-case con- 
ditions, it would have been possible to have 
tested directly between three pairs of condi- 
tions. A level of confidence at around p < .02 
might then have been achieved, as in the 
separate-groups experiment. It took 96 readers 
for the Greco-Latin square, compared with 
five groups each of about half the size (a 
total of 271 readers) in the separate-groups 
experiment. Increasing the numbers of readers 
much further becomes a relatively inefficient 
method of increasing statistical power. It is 
probably more effective to stratify readers ac- 
cording to reading ability, and to use sub- 
groups of equal size drawn from each of a 
number of ability ranges. This is the tech- 
nique used by Zachrisson (1965, Experiment 
4) on children starting to read. It would seem 
to offer the best hope for increasing statistical 
power without sacrificing the representative- 
ness of the readers. 

The relatively small and nonsignificant dif- 
ferences in rates of reading in Table 1 are 
contrary to the results of Fox (1963) quoted 
in the introduction. This is probably because 
our readers knew that they were to be tested 
for comprehension, while Fox’s readers knew 
that they were not to be tested. 


REFERENCES 


Fox, J. G. A comparison of Gothic Elite and 
Standard Elite typefaces. Ergonomics, 1963, 6; 
193-198. 

Paterson, D. G., & Tinker, M. A. Studies of typo- 
graphical factors influencing speed of reading: 
X Style of type face. Journal of Applied Psy- 
chology, 1932, 16, 605-613. 


Tetersintesr Ovtrur 21 


Letter differentiation and rate of 

in reading. Journal of Applied Pry- 
chology, 1965, 49, 358-362. (a) 

Poutrow, E. C. On increasing the sensitivity of 
measures of performance. Ergonomics, 1965, 8, 69- 
76. (b) 

Poutrom, E. C. Searching for newspaper headlines 
printed in capitals or lower-case letters. Journal 
oj Applied Psychology, 1967, $1, 417-425 

Pouttow, E. C, & Fru, P. R. Unwanted 
asymmetrical transfer effects with balanced ex- 


Povtrom, E. C. 
comprebension 


orig designs. Prychelogial Bulletin, 1946, 

Tovers, M. A. Legibility of print. Ames: lows State 
University Press, 1965. 

Tua, M. A, & ame, D. G. Influence of type 
form om speed of reading. Journal of Applied 
Prychology, 1928, 12, 359-368. 

Zacuumson, B. ee See Uppsala, 
Sweden: Almqvist & Wiksell, 1965 


(Received January 25, 1967) 


Bureau of Fal & Psyl, rvs «ren 
GSA cE, muted 


Se a bap re 
rape 


Journal of Applied Psychology 
1968, Vol. 52, No. 1, 22-28 


ORGANIZATIONAL CORRELATES OF EXECUTIVE ROLES * 
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Relationships between each of 9 executive role dimensions and various features 
of the organizational setting were studied. The sample consisted of 194 
middle-management personnel of the U. S. Army. The roles, as described on 
questionnaires by the executives themselves, were found to vary with differences 
in the work setting. Organizational features most prominently associated with 
variations in role included the mission and level of the organization, the job 
family of the executive, and his span of control. The role dimensions which 
were most prone to vary with the situation were staffing, controlling, and 
time spent with others. Executives whose roles emphasized administrative 
more than technical features received higher average performance ratings 


from their superiors. 


It is a well-known fact of social psychology 
that the behavior of members of a social 
system is shaped by characteristics of the 
system as well as of the individuals. The con- 
cept frequently invoked to link the system 
and the individual is that of role, that is, a 
set of activities and responsibilities associated 
with a particular position or job. 

The notion that a given job role may vary 
in different organizations has important im- 
plications for personnel management. Among 
them is the prospect that the personnel re- 
quirements for a job are likely to depend 
on contextual considerations. Failure to pay 
due attention to situational factors could 
therefore result in defective personnel pro- 
cedures, such as invalid selection techniques, 
ineffective training programs, or inappropriate 
supervision. 

While the applicability of these concepts 
to executive personnel management is gen- 
eral recognized, they have received relatively 
little systematic or detailed attention. It would 
seem important not only to know in what 
ways executive roles may differ among organ- 
izational settings, but also whether there are 
stable relationships between given role dimen- 
sions and specific organizational variables such 


1 This research was conducted by contract to the 
Department of the Army under sponsorship of the 
United States Army Personnel Research Office. 

The opinions expressed in this paper are those 
of the authors and do not necessarily reflect official 
Department of the Army policy. 
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as size or span of control. The present paper 
reports research on this topic, undertaken 
as an early stage of a program of improved 
personnel management of civilian executives 
in the Department of the Army. 

More specifically, the study sought to 
determine whether differences in the ways in 
which executives perceive their roles are as- 
sociated with differences in variables char- 
acterizing their organizational settings. As 
subsidiary objectives, the study also sought 
evidence of interaction between organizational 
and personal variables in their relations with 
these role dimensions, and sought data bearing 
on the relationships between role behaviors 
and executive effectiveness. 


METHOD 
Subjects 


The data were collected on 194 male civilian ex- 
ecutives of the Department of the Army. With the 
exception of seven who were in General Service 
(GS) Grades 16 and 17, they were classified in 
GS Grades 13 through 15, roughly equivalent to 
“middle management.” To assure a sample broadly 
representative of such positions, Ss were selected 
from six Army organizations with different missions, 
levels, sizes, and geographical locations. The Ss’ 
mean age was 46.7 yr. (o = 8.1), and had a mean 
supervisory experience of 15.2 yr. (ø = 8.6). The jobs 
covered a variety of titles in such fields as finance, 
research and development, technical specialties, and 
personnel management; relatively few could be con- 
sidered typically “line management,“ since such 
Positions are usually occupied by military officers, 
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Measures of Job Behavior 


This paper is limited to the study of job be- 
haviors and responsibilities as perceived and reported 
by Ss themselves, using the following questionnaires: 


(a) Executive Position Description: an adaptation of 
85 items from the Executive Position Description: 
Form IV developed by Hemphill (1960). Respond- 
ents indicated on a 4-point scale the extent to which 
an activity or responsibility described by an item 
was a part of their position, The following items are 
typical: 


“Schedule work so that it flows evenly and 
steadily.” 
“Reduction of costs.” 


(b) Work Analysis Form: an adaptation of one 
by the same name described by Stogdill and Shartle 
(1955). The respondent estimated the percentage of 
time which he spends in contact with others and 
in individual effort. He then reported the detailed 
breakdown of the time spent in these two major 
categories of duties, and also the time spent on 
duties included in a list of major responsibilities. 
The following items are typical of each of the 
categories, respectively: 


“Attending committee meetings and conferences.” 
“Preparing and writing reports, orders, etc.” 
“Professional consultation—giving technical advice.” 


(c) Performance Style Questionnaire: 40 items 
listing the work patterns, habits, and interpersonal 
relationships a person may emphasize in his job 
following the approach described by Barrett (1961). 
The respondent rated the items in a forced-normal 
distribution according to the extent to which they 
characterize his job behavior. Typical items are: 


“Stays out of arguments.” 
“Gives attention to matters immediately as they 
arise.” 


Measures of Organizational Characteristics 


Information about the organization as a whole, 
such as the number of personnel or type of mission, 
was obtained by the researchers from records and 
interviews with top-level executives. Descriptions 
of certain organizational variables immediate to 
each S’s job, such as his GS level, were obtained 
from him by use of a background information sheet. 
A third type of contextual information concerned 
the supervisory climate in which each S works. 
Some of this was obtained by questionnaire from 
the supervisor himself, including such items as his 
age, education, amount of supervisory experience, 
etc. To provide data on the supervisor’s behavior, 
each S filled out the Leader Behavior Description 
Questionnaire (Stogdill & Coons, 1957) describing 
his perceptions of the extent to which his supervisor 
manifests consideration and initiation of structure. 


Other Measures 


One of the subsidiary objectives of the study was 
to see whether personal characteristics interact with 
organizational characteristics in their relations with 
job behavior. For this purpose, data on certain 
personal variables, such as age and education, were 
obtained on the background information sheet filled 
out by each F. Another subsidiary objective con- 
cerned possible relationships between executives’ job 
behavior and ratings of their effectiveness. For the 
latter purpose, a simple three-scale rating form was 
filled out by the immediate supervisor of each S. 
One of the scales concerned a comparison of the job 
performance of S with others at his level, a second 
entailed a comparison of his performance with 
standards, and a third asked for the supervisor's 
willingness to hire the man if he had his choice. 
This instrument was used in another study of ex- 
ecutives by one of the authors, in which the sum of 
the three scales was found to have an interrater 
reliability of .77. 


Data Analysis 


In order to condense the descriptions of job be- 
haviors and responsibilities into a limited number of 
meaningful role dimensions, a centroid factor analysis 
was performed on the three questionnaires describing 
job activities. Excluded were those items in the 
Executive Position Description where more than 
50% of the respondents reported that the activity 
or responsibility was not relevant to their position. 
This yielded a total of 90 items which entered into 
the intercorrelation matrix. Eight orthogonal fac- 
tors emerged: 


1. Long-Range Planning: planning for future 
activity, including forecasting trends or events, con- 
ducting pilot projects, being concerned with legisla- 
tion which might affect the organization, and estab- 
lishing long-range objectives for the organization. 

2. Staffing: personnel activity characterized by 
involvement in selection, placement, training, and 
advancement of employees. 

3. Technical Consultation: gathering, consolidating, 
interpreting, and providing others with facts and 
information. 

4. Budgeting: preparation and defense of budgets, 
and accounting for expenditures. 

5. Shared versus Individual Responsibility: the 
shared pole being characterized by emphasis on 
keeping others informed of present states of affairs, 
whereas the individual pole emphasizes personal 
responsibility for taking action. 

6. Operational versus Professional Concerns: the 
operational pole being characterized by concern for 
the efficiency of present organizational operations and 
personal professional development, whereas the pro- 
fessional pole is oriented toward advising others on 
technical matters. 

7. Technical versus Administrative Activity: the 
technical pole being characterized by the use of pro- 
fessional tools and techniques, including writing 


TABLE 1 
Rote DIMENSIONS As FUNCTIONS OF ORGANIZATIONAL VARIABLES 


Role dimension 
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with others 


administrative | Controlling | with 


Operational | Technical vs. 
sional 


vs. profes- 


dividual effort 


Role dimension 
Shared vs, in- 


Table 1—Continued 
sultation Budgeting 


Technical con- 


Staffing 
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technical reports, whereas the administrative pole 
involves coordination of activity, communication, 
| and recommendations. 
| 8. Controlling: relating to formulation of limited 
| plans or specifications in regard to resources and 
| finances of the organization, including reducing costs. 
| 


to 
27888288 


SAA SAA 


Only a few of the items from the Performance 
Style Questionnaire were found to have appreciable 
| loadings on any of these factors, so for reasons of 
pa rn effidency it was decided to exclude it from the 
measurement of job behavior. Scores on the eight 
maanqananian factors were computed for each S on the basis of 
aikai kad kai baj specification equations, each including between 5 
and 10 items from the other two questionnaires. 
The percentage of time on the job that the in- 
cumbent reported spending in contact with other 
3 individuals was considered as an additional role 
2288825 variable, because of its intrinsic interest. Thus, a 
total of nine variables constituted executive “job 

behavior” or “roles” as discussed below. 
In passing, it may be of interest to compare the 8 
A factors described above with the 10 which Hemphill 
arononnmams (1960) identified in his study of executives. Long- 
Range Planning is essentially duplicated in both 
studies, and there is some similarity between our 
Controlling factor and his of Business Control. The 


2 other factors are not parallel in the two studies, 

Ans NEE) perhaps because of differences in the jobs, in Ss, in 

S the settings, or some combination of such para- 
meters. 


Analysis of variance was used to determine 

whether each of the nine role dimensions was differ- 

nyanoonne n2 entially associated with organizational character- 
istics. In instances where organizational variables 

yielded continuous distributions, their association 

with the nine role dimensions was determined in- 

stead by product-moment coefficients of correlation. 

The N on which these analyses were based was 

Stanavadae usually somewhat smaller than the total of 194, 
iA because data were not complete for all Ss on all 
variables; no systematic basis was discerned for the 


Organizational variable 


Be loss of cases. 
8 
5 EREEREER RESULTS 
ga 
8 Role Dimensions in Relation to Organizational 
Jes e oS oY BR, Variables 
Table 1 presents, in terms of analysis of 
= D 2 8 8 8 


variance, the comparison of role-dimension 
scores associated with various organizational 
and job settings. Table 2 reports product- 
moment correlation coefficients between role- 
dimension scores and certain continuously 
distributed organizational variables. In both 
ay tables, significance levels are reported on the 
se basis of two-tailed tests. 
Tables 1 and 2 show that there were a 
zs number of significant differences in roles of 
the executives studied, depending on the 


gineering 


Construction & equipment en- 
Technical specialties 


Research & development 
Finance & data processing 


Type of executive job 
Personnel 
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organizational setting in which they operated. Executives in the smaller organizatic 
Organizational features associated with these placed more emphasis on Long-Range Pt 
differences included the organizational mis- ning, possibly because small organizatic 
sion, level, size, stability, and whether urban may receive less attention and direction fro 
or rural in location. The executive's job headquarters and may therefore be mo 
family, GS level, and span of control also autonomous with respect to plans. These ex- 
were influential, The role dimensions most ecutives also spent less time with others, 
frequently varying with organizational char- which may support the common observatic 1 
acteristics were Staffing, Controlling, and Time that larger organizations require more com- 
Spent with Others. These relationships are munication. Not surprisingly, executive 
described in more detail below. in expanding organizations were relative 
As shown in Table 1, the roles of those high in Staffing responsibilities. 
executives who worked in an organization The job family of the executive's position 
having a mission of administration entailed was found to be related to his role: those in 
relatively little emphasis on Long-Range research and development work were less n- 
Planning and relatively strong emphasis on volved in Long-Range Planning, those 
Staffing and on Controlling. Roles in organiza- construction and equipment engineering con 
a major mission in the area of cerned more with Budgeting and less with 

or materials Operational considerations, personnel special- 

to be relatively high on ists more concerned with Long-Range Plan 
and on ning but less with Budgeting, and financi 

Time Spent with Others, and were average people more concerned with Long-Range Plz 
3 and ning and Budgeting. i 

on As shown in Table 2, the more peop e 
— Planning and low Staffing, supervised directly or indirectly by the ex 
C and Time Spent with Others. ecutive, the more role emphasis there was on 
also Long-Range Planning, Staffing, Budge ing, 

to the in- Controlling, Administration, and Time with 
ts. The higher the level, the less Others. 4 
were Staffing and Controlling di- The GS levels covered in this study were 
largely limited to Grades 13, 14, and 15. 

Those executives who worked in the Wash- This restriction of range accounts in part for 
ington headquarters gave less emphasis to the relatively small correlations found in 
Table 2 between GS level and role character- 
ilar those comparing istics. Nevertheless, the higher-level people 
tions, partially because were found to place significantly less stress 
one of the major urban on Controlling and more on Administrative 

settings studied. activity rather than Technical activity or 
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TABLE 2 
COEFFICIENTS or CORRELATION BETWEEN Rote DIMENSIONS AND CERTAIN ORGANIZATIONAL VARIABLES 
n 


Role dimension 


Technical Shared | Operati i 
4 . | Budget- | vs. indi- Operational Technical vs. ri bent 
ing vidual yP protes: adie Controlling! ien 


effort s 


06 10 =.20* 22 182 
=t —.22* 


Note.—N = 145, 
*p <.05 
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acteriatics of Ss’ supervisors anil SP own, role 
perceptions. The coefficients in Table 3 are 
small, sod the ee otio oEn 
not much beyond chance expectancy. There 

is some indication that the supervisors’ formal 
education was appreciably associated with — 
roles of their er hn 
more highly educated 

ordinates whose roles involved less emphasis 
on Long-Range Plamfing and on Controlling, 
and greater emphasis on Professional rather 
than Operational Concerns. This finding may 
be as much a function of the technical nature 
of the jobs held by subordinates of highly 
educated supervisors as it is of the personal 
impact of the supervisor. On comparing the 
columns of Table 3, it appears that Long- 
Range Planning is the role dimension most 
frequently associated with characteristics of 
the supervisor; executives whose roles more 
heavily featured this characteristic were those 
whose supervisors did less Initiation of Struc- 
ture, had less extensive face-to-face contact 
with their subordinates, showed evidence of 


greater mobility in jobs, and had less educa- 
tion. 


“Weraction Effects of Personal and Organiza- 
tional Variables 

In a complete nomology of executive role 
behavior, one would expect that organizational 
influences would interact with personal dis- 
positions. The present study was not designed 
to delve deeply into correlates of 
executive roles, but it did include the col- 
lection of certain data which 
could be examined in this light. The personal 
variables found to be more strongly associated 
with role have to do with educational back- 
ground. For example, Ss with more formal 
education described their jobs as entailing 
less Long-Range Planning, Staffing, Budget- 
ing, and Controlling, but more Professional 
Concerns. However, most of the personal 
variables were found to have only slight rela- 
tionships, if any, with the nine role dimen- 
sions; this statement covers variables like 
age, amount of experience, and 
number of dependents. Perhaps for this reason, 
when two-way analyses of variance were 
performed of role-dimension scores against 
both personal and organizational variables, 
the number of significant interaction effects 
was only about chance, and yielded no mean- 
ingful overall picture. 
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Relation oj Role to Performance Ratings 


One of the nine role dimensions turned 
out to have low but statistically significant 
relationships to the three ratings (median r = 
19), to wit, executives whose roles more 
prominently featured Administrative as con- 
trasted with Technical Activity received 
higher ratings from their superiors. This was 
not due to a tendency to rate higher those 
who occupy higher-level jobs, since the cor- 
relations of ratings and GS level were approxi- 
mately zero. It may signify that administra- 
tive behavior is perceived as a more critical 
requirement of executive jobs of the types 
studied. 


Discussion 


In line with psychological theory, executive 
role was found to be associated with features 
of the organizational and work setting. In this 
sample of positions and organizations, dif- 
ferences in executive roles were especially 
marked between organizations of different 
missions and levels, between jobs designated 
as general administration compared to those 
characterized as scientific or fiscal, and be- 
tween positions showing differences in span of 
control. The role dimensions of Staffing, Con- 
trolling, and Time with Others were par- 
ticularly prone to vary with organizational 
characteristics. 

These results underscore the potential 
fruitfulness of taking organizational settings 
more fully into account when designing ex- 
ecutive personnel management practices, such 
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as techniques of selection, training, or super- 
vision. For example, it would be reasonable 
to expect that critical requirements for ex- 
ecutives having large spans of control, which 
were found here to be characterized by greater 
role emphasis on administrative, planning, and 
personnel matters, would be different from 
those having smaller spans of control. The 
logical next step would be to see whether 
improvements in executive personnel manage- 
ment can indeed be effected when organiza- 
tional variables are included as moderating 
parameters in investigations of the relation- 
ships between management methods and 
criteria of effectiveness. The present study 
indicates the relative potency of a number of 
organizational variables in this regard; by 
specifying how role behavior is associated 
with the more influential variables, it sug- 
gests which aspects of management method 
may be fruitfully examined for their impact. 
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USE OF HIGH SCHOOL DATA TO PREDICT 
WORK PERFORMANCE’ 


MARSHALL H. BRENNER 
Lockheed-California Company 


For a sample of high school graduates employed in an aircraft plant, teachers’ 
work habits and cooperation ratings, absenteeism, and grade-point data were 
obtained from high school transcripts and related to work-performance criteria 


of supervisory ratings, absenteeism, and tardiness records. 


Significant rela- 


tionships were obtained between the high school predictors and the work- 


performance criteria. 


For the great majority of industrial test- 
validation studies, the obtained validities are 
disappointingly low, indicating that the in- 
dustrial researcher must search for additional, 
nontest predictors to be used in conjunction 
with the test information. One of the factors 
that is frequently considered important is 
that of “attitude toward work.” Attitude is 
not frequently utilized, though, because psy- 
chologists have found it extremely difficult to 
obtain useful measures of attitude in an 
employment setting. However, a meaningful 
measure of attitude might be obtained from 
an earlier situation similar in nature to the 
work situation, one in which the § had an 
opportunity to express similar attitudes. This 
study was directed toward the investigation of 
one source of previous evidence of attitude— 
data concerning behavior in the high school. 


PROCEDURE 


The Ss used in this study were 100 employees of 
the Lockheed-California Company who (a) gradu- 
ated from a high school in the Los Angeles City 
School Districts, (b) attended that school for at 
least 2 yr., (c) were employed by the Lockheed- 
California Company within 1 yr. of graduation, and 
(d) were employed by the Lockheed-California Com- 
pany for at least 1 yr. 

The data for the first seven variables (the pre- 
dictors) in Table 1 were obtained from Ss’ high 
school transcripts, while the data for the next five 


1 Extensive assistance was received in connection 
with this study from many members of the Los 
Angeles City School Districts. It was a pleasure to 
work with Muriel Sheldon, Clyde Sorensen, and Rose 
O'Connor of the Secondary Guidance and Counseling 
Section who provided invaluable advice on the de- 
sign of the study and obtained all of the predictor 
data used in it. 
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(the criteria) were gathered from supervisory ratings 
and departmental records, 

The teachers’ work habits and cooperation ratings 
were given on a 3-point scale, and the supervisory 
ratings on a 5-point scale. The “sex” variable was 
scored 0 for females and 1 for males. 


RESULTS 


The means, the standard deviations, and 
the intercorrelation matrix are presented in 
Table 1. 

Reference to Table 1 reveals the following: 


1. The standard deviations of both the 
predictor and the criterion variables are large 
enough to provide adequate discrimination. 

2. The standard deviations of the absen- 
teeism and tardiness data are much larger 
than would be expected if the distributions 
were normal. This indicates that a relatively 
large percentage of Ss had either a very low 
number of absences or a very large number. 
Plotting of the data revealed this to be the 
case. 

3. The teachers’ ratings are very highly 
related to each other and to students’ grade 
point. 

4. The teachers’ ratings, the students’ grade 
point, and school absences are significantly 
related to supervisory ratings. Ten of these 
12 correlations are significant at the 01 
level. 

5. The teachers’ ratings and school ab- 
sences are related to work absences at the 
01 level. 

6. Grade point and rank in class are ex- 
tremely highly correlated, indicating that a 
given grade-point average would be in the 
same relative rank position in each of the 12 
schools furnishing data. 
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TABLE 1 


MEANS, STANDARD DEVIATIONS, AND INTERCORRELATIONS OF VARIABLES 


i 

Variable M eee 
1. School absences | 21.5| 19.6 40 —11 —14 —28 —27 —32 —25 —31 —21 30 2 06 
2. Unexcused | 

school | 

absences | 1.8] 3.3| 40 —04 -06 —18 —16 —25 —11 -31 —09 13 07 17 
3. Three-year | 

grade point 2.2 6 —11 —04 96 78 67 73 37 34 16 10 —58 
4. Rank in class 

(inverted) 42.7 27.4 —14 —06 906 73 63 72 33 34 30 —16 10 —56 
5. Teacher's work 

habits rating 2.1) 4-28 —18 78 73 85 75 34 44 41 —26 —02 —51 
6. Teacher's coop- 

eration rating 2.4 3-27 -16 67 63 85 64 30 45 39 —29 —02 —42 
7. Grade point last d 

two semesters | 2.44 6 —32 —25 73 72 75 64 30 38 26 —16 07 —41 
8. Supervisor's 

ability rating | 3.44 9 —25 —11 36 33 34 30 30 61 72 14 —05 —23 
9. Super visor's 

conduct rating} 3.5} 9 —31 —31 37 34 44 45 38 61 73 —31 —15 —22 
10. Super visor's | 

production 

rating 3.44 9|-—21 —09 34 30 41 39 26 72 73 —23 —14 —29 
11. On- the- job 

absences 11.2 9.2 30 13 —16 — 16 —26 —29 —16 —14 31 —23 37 —03 
12. On- the- job 

tardies 3.6 41) 20 07 10 10 —02 —02 07 —05 —15 —14 37 —22 
13. Sex 06 17 —58 —56 —51 —42 —41 —23 —22 —29 —03 —22 


Note.—A correlation of .20 is necessary for significance at the .05 level. A correlation of .26 is necessary for significance at 
the .01 level. Decimals have been omitted from the correlation part of the table. 


Discussion 


These results indicate the following: 


1. The high school data collected in this 
study have adequate discrimination to serve 
as predictor data. The teachers are making 
meaningful discriminations of work habits 
and cooperation behavior. 

2. The high school data are significantly 
related to performance on the job as evalu- 
ated by the immediate supervisor. Of even 
greater significance, the degree of relation- 
ship between the high school data, especially 
the teachers’ ratings, and the work-perform- 
ance data is greater than that normally ob- 
tained with aptitude tests (e.g., see Ghiselli 


& Brown, 1955, p. 227). In other words, 
whatever it is that determines a supervisor’s 
evaluation of work performance, the high 
school teacher is able to (and does) measure 
it more successfully than aptitude tests. 

3. Absenteeism on the job can be pre- 
dicted to a statistically significant extent by 
absenteeism at school and by the teachers’ 
ratings. 
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VOCATIONAL INTERESTS OF FEMALE 
COMPUTER PROGRAMMERS 


DALLIS K. PERRY axo WILLIAM M. CANNON 


System Development Corporation, Santa Monica, California 


Scores of 293 female computer programmers on the SVIB for men were found 
to be quite similar to those of male programmers, although interests of the 
women were somewhat higher in aesthetic and scientific fields and lower in 
technical and technical-supervision occupations. Female programmers indicate 
interest in all forms of mathematics and lack of interest in people. A key com- 
posed of items with a 12% or larger difference between female programmers 
and other professional women showed a 20% overlapping of professional 
women in general with a cross-validation group of female programmers. 
Programmers who are dissatisfied with their occupation score significantly 
lower on the key than do satisfied programmers. 


In the process of measuring the vocational 
interests of computer programmers and de- 
veloping a Strong Vocational Interest Blank 
(SVIB) scale to represent these interests, a 
substantial number of completed SVIBs for 
men were obtained from female programmers. 
The following points became clear through 
analysis of SVIB scores for the female pro- 
grammers: 


(a) The vocational interests of female pro- 
grammers are highly similar to those of male 
programmers. 

(b) The differences in interests that char- 
acterize men and women in general charac- 
terize male and female programmers as well, 
with the result that the male programmer key 
with male norms does not adequately repre- 
sent the interests of female programmers 
(Perry & Cannon, 1967). 


Because there is considerable overlapping 
of items between the SVIBs for men and for 
women, it appeared feasible to use the re- 
sponses of female programmers on the SVIB 
for men to develop a computer programmer 
key for women that could be used with the 
women’s form of the blank. At the same time 
it seemed desirable to provide normative data 
to facilitate the interpretation of women’s 
scores on the male programmer key in those 
instances in which the men’s SVIB form is 
used with women. The present study, there- 
fore, was intended to (a) develop a computer 
Programmer key for the SVIB for women, 


31 


and (b) develop female norms for the pro- 
grammer key for the SVIB for men. 


METHOD 


Subjects. The 1,378 computer programmers par- 
ticipating in the original study included 186 females. 
This group was augmented by 107 additional females 
from six of the large programming organizations in 
the original study. The same data-collection and re- 
porting procedures were again followed to assure 
anonymity of participants (Perry & Cannon, 1967). 
The original data were gathered in 1964 and the 
additional data in 1966. 

The total group of 293 female programmers was 
reduced for purposes of key construction and valida- 
tion by dropping 31 members who indicated that 
they (a) were dissatisfied with programming (“prob- 
ably” or “definitely” would prefer another field); 
(b) were primarily responsible for computer opera- 
tion, programmer instruction, or computer installa- 
tion management; or (c) had less than 1 yr. of 
experience in programming. With the exception of 
the last restriction, these constraints are the same as 
those applied to the male programmers. Male pro- 
grammers with less than 2-yr. experience were 
found to have vocational interests nearly identical 
with those of more experienced programmers. Con- 
sequently, it appeared justifiable to lower the ex- 
perience requirement from 2 yr. to 1 yr. in the in- 
terest of maintaining sample size. The resulting 
primary sample of 262 female programmers was con- 
sidered adequate for key construction and valida- 
tion, although it was smaller than desired. 

Sample characteristics. The Ss in the primary sam- 
ple ranged in age from 21 to 59, with a median of 
29.9, mean of 31.6, and standard deviation of 7.3. 
Almost 90% were 40 or younger. This group is thus 
younger than all the present SVIB female criterion 
groups except secretary-stenographer (Strong & 
Campbell, 1966). Like male programmers, more than 
90% of the females had attended college, but more 
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were college graduates and ſewer held advanced de- 
grees. A much larger proportion of the females (60%) 
reported majors in mathematics, with no large 
grouping in any other field. All but 5% of the 
sample reported some kind of formal training in 
programming, the average amount being 160 hr. As 
might be expected, the proportion of females with 
machine-operations experience (20%) was much 
smaller than that for men (37%), and the average 
amount of operations experience was much less. 
Mean programming experience (5.3 yr.) of the 
female sample, however, was nearly identical to that 
of the male sample, despite the inclusion of a few 
I- yr. programmers in the female group. 

Employment characteristics of the female pro- 
grammers were substantially different from those of 
the male programmers. Greater proportions of the 
females were in government and research and de- 
velopment organizations, and smaller proportions 
were in business and industrial firms. More of the 
females were on large programming stafis, and fewer 
were doing business programming. Fewer females 
were involved in requirements analysis, program 
system design and installation, and supervision. 


Median monthly salaries of the women ($757) aver- 
aged about $75 less than the men, despite the fact 
that experience of the two groups was about the 
same and about one-third of the women were re- 
sponding 2 yr. later than the men. Examination of 
similar data for the portion of the female sample 
obtained in 1964 indicated that the above differ- 
ences between male and female programmers were 
not due to differences in sampling or dates of col- 
lection. Apparently there are differences in the kinds 
of programming employment entered by, or available 
to, female programmers. 

Procedure. For construction of interest keys the 
primary sample was divided randomly into two parts: 
a criterion group of 200 cases, and a validation 
group of 62 cases. The reference group represented 
an equal sampling from each of the 26 female SVIB 
occupational criterion groups.! 

Items available for use in a new key for women 
included only those that: 


1 Reference-group response percentages were ob- 
tained from the Center for Interest Measurement Re- 
search, University of Minnesota. 


TABLE 1 
Means AND STANDARD DEVIATIONS or FEMALE PROGRAMMERS on SVIB Keys ror MEN 
Key M SD Key M SD 
1 Optometrist 37.53 11.51 | 31 Soc. Science Teacher 27.89 10.58 
2 Physical Therapist 31,00 11.36 32 Sch. Superintendent 28.62 | 11.60 
3 Dentist 32.71 | 10.59 33 Minister 26.23 | 10.99 
4 Osteopath 28.80 | 09.65 | 34 Librarian 40.50 | 10.38 
5 Veterinarian 18.59 09.71 35 Artist 31.24 10.31 
6 Physician 37.85 11.64 |36 Art Teacher 28.74 11.06 
7 Psychiatrist 36.74 10.30 |37 Musician Performer 44.38 09.24 
8 Psychologist 37.10 10.46 |38 Music Teacher 34.90 10.48 
9 Biologist 38.84 11.19 39 C.P.A. Owner 37.61 09.10 
10 Architect 36.81 11.12 |40 Senior C.P.A. 40.32 | 08.84 
11 Mathematician 36.00 10.73 |41 Accountant 37.80 10.06 
12 Physicist 29,51 13.31 | 42 Office Worker 35.64 09.98 
13 Chemist 41.22 11.56 | 43 Credit Manager 32.12 11.00 
14 Engineer 39.43 10.74 44 Chamber of Commerce Executive 31.10 10.37 
15 Production Manager 34,27 07.46 | 45 Bus, Educ. Teacher 30.91 11.22 
16 Army Officer 26.89 | 12.15 | 46 Purchasing Agent 26.11 | 09.36 
17 Airplane Pilot 28.96 10.60 | 47 Banker 22.71 09.09 
18 Carpenter 20.99 11,86 | 48 Pharmacist 29.95 08.45 
19 Forest Service Man 22.11 12.32 | 49 Mortician 26.63 08.53 
20 Farmer 29.60 09.49 | 50 Sales Manager 23.93 09.05 
21 Indus. Arts Teacher 25,54 10.03 | 51 Real Estate Salesman 28.59 07.08 
22 Math-Science Teacher | 40.17 08.86 | 52 Life Ins. Salesman 22.30 | 09.20 
23 Printer 30.49 08.95 | 53 Advertising Man 31.61 08.88 
24 Policeman 24.95 07.75 54 Lawyer 33.62 08.48 
25 YMCA Phys. Director 21.81 11.64 | 55 Author-Journalist 33.24 07.91 
26 Personnel Director 31.64 11.78 56 Pres.-Mfg. Concern 20.02 09.20 
27 Public Administrator 36.32 09.58 57 Specialization Level 44.33 09.66 
28 Rehabilitation Couns. 31,30 10.19 58 Occupational Level 59.84 06.52 
29 YMCA Secretary 20.85 11.93 59 Masculinity-Femininity 37.99 08.85 
30 Social Worker 29.40 11.98 60 Computer Programmer 20.34 10.17 
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(a) had women-in-general response counts avail- 
able (ie, were in the 1946 edition of the SVIB for 
women); and 

(b) appeared in the revised SVIBs for both men 
and women in identical or equivalent forms. Judg- 
ments of equivalence were rather conservative, an 
only a few items were not identical. These included 
6 that involved direct changes in gender (e.g actor- 


Fic. 1. SVIB profile for male and female compu 
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ter programmers. 


actress) and 18 that differed slightly in wording 
(eig., member of supreme court supreme court 
justice; president of society president of society or 
club; author of “best seller”—author of best-selling 
novel). 
There were 197 items that met these requirements 
and were therefore available for key construction. 
Several interest keys of varying numbers of items 
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TABLE 2 
MEANS AND STaxpaRD Deviations or Four Groups 
ON THE Pemate Computer Proorammer Key 


Group N M SD 
Criterion 200 20.26 | 11.52 
Validation | 62 1821 9.38 
Women in General —11,52 | 13.73 
Dissatisfied 24 13.66 


9.04 | 
Í Í | 
* Estimated as 1.24 times the combined criterion-validation 
group standard deviation, based on the average ratio of women- 
244 — to criterion group standard deviations for existing 
SVIB keys. 


were constructed and evaluated. Differences in 
validity were slight, and a key composed of items 
with a 129% or larger difference between criterion and 
reference groups was selected as having the best 
combination of both validity and sufficient length 
to provide adequate reliability. 

The answer sheets were scored on 60 male SVIB 
keys, including the male programmer key, and 
normative data for female programmers on both the 
male and female programmer keys were computed. 


RESULTS 


Interests of female programmers. Means 
and standard deviations of female program- 
mers’ scores on 60 SVIB keys for men are 
listed in Table 1 and shown in profile form in 
Figure 1. For comparison, the profile for male 
Programmers also is shown in Figure 1. The 
similiarity of the two profiles, despite the fact 
that all the scores were standardized on men, 
is apparent. The women have higher scores in 
the aesthetic fields, as would be expected, 
somewhat higher scores in scientific fields, 
and lower scores in the technical supervision 
and technical occupations. The marked simi- 
larity of the profiles, even in occupational 
areas that show differences in level of in- 
terest, emphasizes the similarity of interests 
of male and female programmers. 

Examination of the SVIB items that differ- 
entiate female programmers from women in 
general reveals two major clusters paralleling 
those for male programmers. These clusters 
indicate a strong interest in all forms of 
mathematics and a lack of interest in people 
—especially in activities involving responsi- 
bility for helping people. Female program- 
mers are also more interested than other 
women in physical science and technical ac- 
tivities and less so in natural and social 


science and literature. They prefer indepen- 
dent, nonroutine work. 

Female programmer key. The female pro- 
grammer key contains 156 weighted responses 
to 88 items. Means and standard deviations 
of criterion, validation, and reference groups 
are given in Table 2. The percentage of over- 
lapping of the criterion and reference groups 
is 21, and of the validation and reference 
groups it is 20. In these validity character- 
istics the key compares favorably with exist- 
ing SVIB keys for women. Two-week retest 
reliability of the key for 112 women is 90. 

Of the 31 female programmers not included 
in the primary sample, 24 were excluded 
cause they indicated that they “probably” 
“definitely” would prefer a different field of 
work. The mean and standard deviation of 
programmer key scores for this group of 
dissatisfied programmers are also shown in 
Table 2. The average score of the dissatisfied 
group is substantially and significantly lower 
than are those of the other programmer 
groups, providing additional evidence of the 
effectiveness of the key. 

Relationships to male programmer key. 
Of the 102 responses in the male programmer 
key and 156 responses in the female pro- 
grammer key, only 44 were common to the 
two keys. However, of the 58 responses in 
the male and not in the female key, only 
15 were available for inclusion in the female 
key. The large number of responses in the 


Reliability was computed by the Center for In- 
terest Measurement Research and provided through 
the courtesy of David P. Campbell. 


TABLE 3 


Normative DATA ror FEMALE COMPUTER 
PROGRAMMERS ON MALE AND FEMALE 
COMPUTER PROGRAMMER KEYS 


Raw score Percentage 

Heaton 
Female on female key 

pea standard 
score Female Male 
key key Pro- 
grammers| W-. G. 

A 45 to 14to 81 16 to 56 70.2 34 
B+ | 40 to 44 9to 13 12to 15 16.4 3.8 
B 35 to 39 3to 8 to 11 7.2 8.2 
B— |30to34|— 2to 2 4to 7 3.4 10.1 
C+ 25 00 20 — 8 to 3 Oto 3 1.9 15.8 
€ to 24 —77 to —9 | —42 to =1 0.8 58.7 


»Womemin-general estimated, 


VOCATIONAL Interests or Feuare Computes Proceamurns 35 


female and not in the male key is partially 
due to the less stringent criterion (12% 
rather than 15% difference between criterion 
and reference group) for including items in 
the female key. The correlation between scores 
of female programmers on the two keys is 
.60. The correlation among women in general 
would be substantially higher. Correcting for 
the larger variance to be expected for women 
in general increases the correlation to about 
70. The relationship between the two keys 
is quite high, but not high enough so that 
one may be substituted for the other or the 
results on the two accepted as equivalent. If 
` responses of women are obtained on the 
SVIB for men, their scores on the computer 
programmer key, expressed in terms of female 
norms, will be a meaningful measure of their 
interest in programming. Such scores, how- 
ever, cannot be interpreted in exactly the 
same way as scores on other SVIB keys for 
men (without female norms) or on the 
SVIB for women. 


Normative daia. Scoring weights for the 
computer programming key for women have 
been provided to the major SVIB scoring 
services. Standard-score and letter-grade 
norms for female programmers on both the 
men's and the women’s computer programmer 
keys are given in Table 3. The norms are 
based on combined criterion and validation 
groups. Standard scores based on female 
norms for the men’s key are 4 to 5 points 
higher than standard scores based on men’s 
norms, indicating that when the men’s SVIB 
is used with women, the resulting computer 
programmer scores will be about one letter 
grade too low unless they are converted 
by Table 3. 
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SELECTION OF SALESMEN BY MEANS OF AN 
ASSESSMENT CENTER 


DOUGLAS W. BRAY axo RICHARD J. CAMPBELL 
American Telephone and Telegraph Company, New York City 


Newly hired candidates for sales positions were evaluated by means of an 
assessment center consisting of paper-and-pencil tests, an interview, and 
individual and group simulations. Assessment staff judgments were compared 
with job performance some months later as evaluated by a special observa- 
tional team. Assessment results were strongly related to this criterion. Super- 
visors’ and trainers’ ratings were not significantly related to the job-performance 
criterion nor to assessment results. The findings lend support to recent studies 
indicating the efficacy of the assessment-center method in personnel selection. 


The assessment center (Taft, 1959) re- 
ceived its first industrial application when 
it was introduced into the Bell System in 
1956 as a major research method of the 
Management Progress Study (Bray, 1964). 
Two years later, in 1958, an assessment center 
was opened in the Michigan Bell Telephone 
Company for the appraisal of candidates for 
promotion to management from vocational oc- 
cupations. The results of this first operational 
assessment center were deemed so useful by 
line managers that the assessment-center 
method spread widely through the Bell Sys- 
em until over 50 assessment centers were 
established in the Bell System Companies 
processing approximately 8,000 men and 
women each year who are candidates for 
management positions. 

Follow-up investigations in the Manage- 
ment Progress Study (Bray & Grant, 1966) 
demonstrated considerable predictive power 
for the assessment-center method both for 
young noncollege management men who had 
emerged from the ranks at an early age and 
for college recruits assessed very shortly after 
employment. In addition, a study of 500 men 
recently appointed to management (Campbell 
& Bray, 1967), some of whom had not been 
assessed and some of whom had received 
various ratings at the assessment center, fur- 
ther demonstrated the usefulness of the 
method. 

The above assessment centers were designed 
to assess the ability to function successfully 
as a supervisory line manager. The present 
article describes the application of the assess- 
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ment-center method to a quite different prob- 
lem—the selection of prospective communica- 
tions consultants. The job of the communica- 
tions consultant is to work with business 
customers to determine their objectives, un- 
cover inconsistencies in methods of reaching 
these objectives, and suggest improved com- 
munications, where appropriate, for im- 
proved operations. 

A first step in devising an assessment 
center is a statement of the qualities relevant 
to success in the job for which assessment is 
planned. In order to arrive at this list of 
qualities, a psychologist experienced in the 
assessment-center approach due to his work 
on the Management Progress Study, David 
Berlew, spent several weeks interviewing 
sales personnel, observing sales training, and 
making numerous customer visits with ex- 
perienced communications consultants. When 
a preliminary list of qualities had been iso- 
lated, assessment techniques designed to re- 
veal these qualities were selected or developed. 
A group of experienced assessors, under the 
direction of the senior author, then applied 
these methods to a trial group of six successful 
salesmen. 

This pretest resulted in some redefinition of 
the assessment qualities and some revision 
of the assessment-center techniques. The list 
of qualities was reduced to 20 and included 
such characteristics as resistance to premature 
judgment, oral presentation, oral defense, be- 
havior flexibility, and persistence. The major 
assessment techniques were three simulations: 

Leaderless Group Discussion—This tech- 
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nique involves six assessees. They are told to 
assume that they are attending a meeting with 
five of their peers to discuss candidates for 
promotion from among their fictitious sub- 
ordinates. Each is given a description of a 
subordinate whom the assessee is to present 
as his candidate. Each assessee is given up to 
5 min. to present his man, followed by a 
1-hr. group discussion which must result in a 
rank ordering of the six candidates, 

Oral Fact-Finding Exercise—This is an 
individual technique in which the assessee is 
told that he is an arbitrator in a labor- 
management dispute. He is given a short out- 
line of the dispute and the opportunity to 
question a staff member to determine addi- 
tional facts and issues. He then presents his 
decision and its rationale orally to the staff 
member and is questioned intensively about it. 

Consulting Case—This is an individual 
technique in which the assessee is given 
written material—textual, graphic, and tab- 
ular—concerning the operation and problems 
of a business concern. His task is to study 
this material and prepare recommendations 
for the company. He presents his recom- 
mendations orally to two staff members who 
role play the two controlling partners of the 
enterprise and discusses his recommendations 
with them. He then prepares a final written 
recommendation. 

In addition to these simulations, the 
assessees completed a biographical data 
blank, underwent a lengthy interview, and 
took four tests, all cognitive in nature. These 
tests were the School and College Ability 
Test, a Critical Thinking Test, an Abstract 
Reasoning Test, and a Test of Knowledge of 
Contemporary Affairs. 

A trial of the assessment-center methods 
outlined above was conducted by setting up 
a Sales Assessment Center in Cleveland, Ohio 
to which some 14 participating Bell Telephone 
Companies sent candidates with little or no 
Bell System sales experience. This assessment 
center was staffed by a seven-man team, in- 
cluding a fourth-level director and six third- 
level sales managers from various telephone 
company sales departments. This staff re- 
ceived 3 weeks’ training by the authors and 
others before beginning the actual assessment 
of the trial Ss. 
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Assessees were processed by the assessment 
center in groups of six, and it required 2 full 
days for each group of six to undergo the 
various assessment techniques. Only one group 
of six was processed per week, however—the 
remaining days of each week being used by 
the assessment staff in preparing reports of 
the behavior of each candidate in each ex- 
ercise and for holding staff sessions to hear 
these reports for each candidate, to rate him 
on the 20 sales-assessment qualities, and to 
make a final recommendation as to his suit- 
ability as a communications consultant. 

In all, some 142 men from the 14 par- 
ticipating telephone companies were assessed, 
but various factors reduced the sample of men 
for the present report to 78. This sizable 
reduction was due to the fact that not all of 
the men sent to the assessment center met the 
standards which had been set up for the 
study. Some had had previous Bell System 
sales experience, others were on a special 
training program for college recruits, and some 
were not assigned to communications con- 
sultant work after assessment. In addition, a 
few resigned from the company before on-the- 
job criteria were collected. 

Great care was taken that the results of 
the man’s performance at the assessment 
center did not affect his assignment or ap- 
praisal on the job. All of the men in the study 
were Bell System employees at the time of 
their assessment and there was no feedback to 
their trainers, their supervisors, or to the 
men themselves on their performance at the 
assessment center. It was anticipated that if 
the assessment center proved a valuable 
screening method, it would be used as a pre- 
employment screen, but since the individual 
results of the trial assessments were not to 
be fed back to the supervisors immediately 
concerned with the man’s training or assign- 
ments, all Ss were employed. In fact, the 
participating companies were urged to keep 
as many as possible on the payroll so that 
criterion data could be collected after they 
had had sales training and were on the job 
for several months. 

Before proceeding to a discussion of the 
criterion against which the assessment judg- 
ments were compared, it may be of interest 
to examine the final judgments of the candi- 
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TABLE t 


AssessMent-Starr Jopoments OF ACCEPTABILITY 
ror Sates EMPLOYMENT 


* i | Percentage 
Assessment judgment | No. men of group 
More than acceptable | 9 12% 
Acceptable 32 41% 
Less than acceptable 16 20% 
Unacceptable 21 27% 
Total 78 100% 


dates made by the assessment staff. After the 
reports on each candidate were read at the 
weekly staff meeting and the 20 sales qualities 
rated on a 5-point scale, the assessment staff 
made a final judgment as to each candidate’s 
acceptability as a potential communications 
consultant. The 78 assessees were put into 
four groups—one group was called “more 
than acceptable,” another “acceptable,” a 
third group was considered “less than ac- 
ceptable,” and the final and lowest group was 
considered “unacceptable.” Table 1 shows 
the distribution of assessment-staff judgments. 
It will be seen that only 9 of the 78 men 
performed outstandingly at the assessment 
center while 21 men did very poorly. The top 
two groups—the acceptable groups—totaled 
41 men, or 53% of the sample, while 471% 
were considered unacceptable in varying de- 
grees. 

This finding itself is of interest. It indicates 
that even though the 78 assessees had met all 
employment standards and had been screened 
by their local companies as qualified for the 
job of communications consultant, experienced 
sales managers judged that from what they 
could see at the assessment center, only about 
half possessed good potential for their pro- 
posed job. This result is in keeping with 
other assessment findings in the Bell System, 
both of the college recruits of the late 1950s 
and the male and female candidates for man- 
agement in the various telephone companies. 
Assessment staffs characteristically reject one- 
half or more, usually more, of the candidates. 

The distribution of assessment-staff judg- 
ments was not significantly different for the 
64 men who were eliminated from the study. 
Forty-seven percent of that group “passed” 


the center as compared to 53% of those re- 
tained in the sample. 

The primary criterion against which as- 
sessment-center judgments were compared 
was first-hand observation of actual behavior 
in sales contacts 6 mo., on the average, after 
assessment. By that time each of the candi- 
dates had had sales training and had been 
working in the field for several months. This 
criterion had been decided upon before the 
assessment study was even begun. It was 
considered more pertinent than “sales results” 
which are influenced by many factors other 
than the behavior of the salesman. Company 
sales executives had, in any case, indicated 
that because of differences in market char- 
acteristics in the many areas across the United 
States to which these candidates were as- 
signed, no statistic or group of statistics would 
accurately reflect individual performance. 

The performance observations were carried 
out by a field-review team working out of 
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quarters in New York. This group’s regular 
work involved such field reviews, and it was 
an entirely experienced team that went out 
on these special criterion reviews. A reviewer 
accompanied each S on as many visits as 
necessary to determine whether he did or did 
not meet established standards in conducting 
his sales activities, These standards included 
preparation, usage prospecting, recommenda- 
tions, closing, and implementation. As a re- 
sult of the observations, each man’s perform- 
ance was classified as meeting standards or 
failing to meet standards. The reviewers were, 
of course, entirely unaware of any judgments 


TABLE 2 


- 
ASSESSMENT JUDGMENTS AND FEI D- 
PERFORMANCE RATINGS 


No. Percentage 


Assessment judgment No. meeting meeti ng 
men review review 

standards] standards 
More than acceptable 9 9 100% 
Acceptable 32 19 60% 
Less than acceptable 16 7 44% 
Unacceptable 21 2 10% 
Total 78 37 47% 


Note.—x? = 24.19; p < 001. 
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or observations made of the candidate at the 
assessment center. 

Of the 78 men, 37 men or 479% met review 
standards. Table 2 shows the relationship 
between the assessment-center judgments and 
whether or not the salesmen met the stand- 
ards. It will be seen that all of the 9 men 
deemed more than acceptable at the assess- 
ment center met review standards, while only 
2 of the 21 men deemed completely un- 
acceptable at the assessment center passed 
the field review. 

A supplementary investigation: looked 
into the ratings of the Ss given by their sales 
supervisors and by the trainers who had had 
each man in sales training. A preliminary 
letter was sent to each supervisor indicating 
that he would be asked to rate each salesman 
reporting to him. He was asked to prepare 
himself to do this and, when ready, to tele- 
phone either of a two-man team in the sales 
staff in A. T. XT. in New York. The New York 
staff member verified that he and the super- 
visor each had the same list of salesmen, 
and the supervisor was then asked to rank all 
of the men reporting to him in terms of their 
excellence as a communications consultant. 
After the ranking was recorded, the super- 
visor was then asked to rate each man on a 
graphic rating scale, the upper end of which 
was labeled “highest—I wish I had many 
more like him“ and the lower end labeled 
“lowest—He’s in the wrong job.” The super- 
visor was then asked whether he knew if any 
of the men had been assessed and, if he did, 
he was asked whether he knew how the man 
had come out at the assessment center. In 
most cases, the supervisors did not know the 


1This approach was devised by Ira Cisin of 
George Washington University. 


TABLE 3 
ASSESSMENT JUDGMENTS AND SUPERVISORS’ RATINGS 


Supervisors’ rating 
Higher 


43% 
53% 


i 
Assessment judgment | No. men 


Acceptable 41 
_ Less than acceptable 37 


Note.—Differences not statistically significant. 


3 
TABLE 4 
Serzevisoes’ Ranxos axo Finto 
Prsrosxaxce Rariwos 
| | Did not 
Supervisors 5 | Met review | meet pe- 
rating Moime | * candards | slew stan 
dards 
Higher | a | on | 9% 
Lower | 37 w | M 


Note.— Difieteaces not statistically squificant 


men had been assessed and in no case did the 

have any accurate information 
about the outcome of the man’s assessment. 
The same procedure was carried on with 
trainers, who rated and ranked all men in 
each training group in which an assessee had 
been trained. 

In order to maximize differentiation for 
purposes of comparison, the average rating 
given by the supervisors and by the trainers 
was determined. The study group was then 
divided at about the midpoint of each set 
of ratings. In the following tables, those men 
who scored below the midpoint are described 
as having received higher ratings and those 
who scored below the midpoint are described 
as having received lower ratings. It should be 
noted, however, that the ratings were biased 
in an upward direction, so some of the men 
counted as receiving lower ratings were not, 
in an absolute sense, rated below average. 

Table 3 shows the relationship between the 
assessment-center judgments and the super- 
visors’ ratings. It can be seen that there was 
not a strong relationship—57% of those who 
received acceptable assessment-center judg- 
ments received higher ratings by the super- 
visors, as compared to 47% of those who 
had received unacceptable assessment-center 
ratings. This difference is not statistically 
significant. 

These results immediately raised the ques- 
tion of to what extent the supervisor’s ratings 
were related to the field-performance ratings. 
Table 4 shows the results. It will be seen that 
there was very little or no relationship be- 
tween supervisory ratings and ratings of per- 
formance on the sales job by the field-review 
team. Such a result has, of course, consider- 
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TABLE å 
Nin Jopowzrt: axo Tesorees’ Rares 


Trainees” rating 
No men |—— — 
Higher | Lower 


l 


rn jadgment 


mi so | o% 
** * 


Acerptabie 
Lers than acceptable 


Note eee mot statisticaiiy 36 


TABLE 6 


Tux Ranxo axo Fretp-Pearoemance 
ATING 
ee — — U—v— —— 2 
Did not 


Ga 
i Met review meet re- 
No. men standards | view stand- 
ards 
4i o uo 
* on W% 


Higher 
Lower 


Note, —Diflerences sot mtatistically sanificant 


able import for questions of appraisal. Of 
considerable importance, 


pE 
j 
i 
i 


more of a suggestion that 
t be able to predict field per- 
Table 6 shows. Fifty-six per- 
men who received higher training 


1 
l 


+ 
f 
4 
E 
| 
: 


is, however, not stati: 
addition, 85% of the Ss had 
training ratings— 


J 


given 
not a very fine screen. 


between trainers’ ratings and field performance 


ship between trainers’ ratings and perform 
ance, and almost two-thirds of those who get 
lower ratings show good performance. It does 


tional and rating methods could be improved. 
It may be of interest to compare the 
degree of prediction obtained from the assess- 


assessment judgment with the field-perform- 
ance ratings. 

The biographical blank used at the assess- 
ment center was administered to give the 
staff members some advance information be- 


TABLE 7 


TRAINERS’ RATING AND FIELD-PERFORMANCE RATING 
(MEN JUDGED ACCEPTABLE OR BETTER AT 
ASSESSMENT CENTER ONLY) 


] 


Did not 
Trainers’ z Met review | meet re- 
rating Nore standards | view stan- 
dards 
Higher 21 71% 29% 
Lower 20 65% 35% 


Note.—Differences not Statistically significant. 
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TABLE 8 


CORRELATIONS OF ASSESSMENT PREDICTION AND 
PAPER-AND-PENCIL TESTS WITH 


Ss 


Assessment judgment | 51 
SCAT n 
8 Thinking Test 26 

mtemporary Affairs Test 28 
Abstract Reasoning Test .02 
Multiple R—Four Tests 33 


Setection op Sateawen 4t 
fore conducting interviews with the candi- formly shown that the method has mistastial 
dates. The blank was not scored. Because of validity, This bas bees tree whether the 
the small size of the sample, it is unwarranted amenmment staf consisted of 
at this time to analyze these biographical fessional piychologists oe — 2 or 
items or to attempt to combine by multiple management persoanei, The present ee 
correlational techniques all the various pieces demonstrates that as amesument center states 
of data gathered during the 2 days of amex by sales managers isstrocied is eee, 
ment. Such analyses can be done later when techniques cas afford a valuable ald is 
more cases accumulate and are now almost the selection of proupective saleumen. 
completed for a auexment 
center (Bray & Grant, 1966; Grant, Katkov- REFERENCES 
sky, & Bray, 1967). Bear, D. W. The progres 

Prychedogit, 1944, 19, 419-420 
Discussion Baar, D W. A Caur, D. L The ent orster 


The assessment-center method of evaluating 
potential for various occupations has had a 
spotty history (Taft, 1959). Conclusions 
about its effectiveness have been difficult to 


missoes! 
ministration, 1907, 3002), 6-1) 
Geaxt, D. L, Karsownry, W. & Bar, D, W. 
Contributions of projective aten. 


T 
(1951a, 1951b, 1953 
28 With tests as 


inventories have proved successful with several the only 
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Search scientists, Buel (1965) 
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.= University 


Among the many approaches followed in Space Administration (NASA) scientists, 
‘aylor, 


„ Ellison, and Tucker (1966) obtained 
of test validity coefficients in the .40s and .50s when 
as cri- biographical inventories were cross-validated 


. The former approach is against several criteria of creative achieve- 
by the research of Guilford (1959, ment. 
1967), — and Jackson (1962), and With only a few exceptions, investigations 


1962); the latter by that of Roe of creativity at the high school or college 


personnel in a 


with technical engineering 
personnel, and Chambers (1964) with eminent (NSF)-supported summer science program. 
psychologists and chemists. In a series of Thetbiographical inventory proved to be the 


with research of the bi 
company, Mc- NASA scientists to high school students par- 


) and MacKinnon level have employed tests as criteria, pre- 
4 criterion, not only dict or both. Dauw (1966) reports suc- 
— 


erentiation between highly creative 
and less creative adolescents by means of a 


As a predictor and as an — b biographical inventory, but his subjects (Ss) 
of 


the ere selected solely on the basis of creativity 
x test scores. Parloff and Datta (1965) com- 
in pre- pared contrasted groups of participants in 
in in- the Westinghouse Science Talent Search, 


educational contexts selected on the basis of judges’ ratings of their 


research projects. However, these groups were 
chiefly in personality test scores, 
items reported being 
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in¥entory developed on 
ticipating in a National Science Foundation 


Aeronautics and best overall predictor of the creative research 


Performance of these students, its validity 


of a larger project reaching 47 in one of the groups. In a 


of Mental Health and sub. follow-up of National Merit Scholarship 
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TABLE 1 


BIOGRAPHICAL INVENTORY SCORES OF CRITERION 
GROUPS IN Cross-VALIDATION SAMPLES 


| Creative-Scientific 


Creative-Artistic 


key* | key> 
Score ‘= 2 p 
Creative- | Control- | Creative- | Control- 
Artistic | Artistic | Scientific | Scientific 
Over 110 6 | o0 0 0 
101-110 6 1 0 | 0 
91-100 11 2 0 0 
81-90 10 4 1 0 
71-80 7 8 7 2 
61-70 8 2 15 3 
51-60 2 7 9 11 
41-50 0 9 5 9 
31-40 0 9 6 13 
21-30 0 3 4 10 
11-20 0 3 2 2 
1-10 0 2 1 0 
N 50 50 50 50 
M 88.76 52.38 53.50 41.62 
SD 19.58 24.32 17.69 13.91 
Range 52-140 3-110 4-84 17-72 
NCR 8.23* 3.74* 
his -64* 05" 


* Possible range —96 to 335. 
Possible range —82 to 239. 
* < 001, 


tests. For each of the 3,930 scorable items, classified 
as present or absent, a phi coefficient was computed 
against the dichotomous criterion of creative versus 
control. These coefficients were computed separately 
in artistic and scientific criterion groups. All items 
with phi coefficients at the P<.20 or better sig- 
nificance level were considered for inclusion in the 
initial CrA or CrS scoring keys. Some of these items 
were excluded because they duplicated other items, 
were checked by three or fewer Ss, or were incon- 
sistent with other Tesponses and hence were likely to 
have yielded isolated chance correlations, 

In the initial scoring keys, a weight of 1 was 
assigned to items discriminating between the 7 A 20 
and $< 05 levels, and a weight of 2 to items dis- 
criminating at the p C Os level or better. Items with 
higher frequencies in the creative group received posi- 
tive weights; those with higher frequencies in the 
control group received negative weights. The initial 
CrA and CrS scoring keys were used in scoring 
the biographical inventories of the corresponding 
creative and control Ss in the cross-validation 
samples. The scorers were unaware of the criterion 
status of the Ss. The scores thus obtained were cor- 
related with the dichotomous criterion to provide 
an estimate of the validity of the scoring keys. In 
order to utilize all the data in the selection of items 
for final keys, item analyses were carried out in- 
dependently in initial and cross-validation samples 
and those items were selected that differentiated 


between criterion and control groups with a com- 
pound probability of os or better (Baker, 1952; 
Katzell, 1951). 


RESULTS 


The cross-validation data for the initial 
keys are summarized in Table 1.* In both 
artistic and scientific groups, the mean scores 
of creative Ss excelled those of controls at the 
-001 level of significance, with normal curve 
ratios of 8.23 and 3.74, respectively. The 
point-biserial criterion correlations, whose 
significance is indicated by the above-normal 
curve ratios, were .64 and .35, respectively. 

Examination of the frequency distributions 
in Table 1 indicates that for both keys a 
cutoff score of 60 yields the maximum dif- 
ferentiation between creative and control 
groups. A simple index of differentiation for 
this purpose is the difference between cumula- 
tive percentage frequencies in creative and 
control groups. With a cutoff above 60, 96% 
of the CrA Ss in the cross-validation sample 
will be correctly classified, while 34% of the 
controls will be incorrectly classified as crea- 
tive. With the CrS key, 46% of the creative 
Ss will be correctly classified, while 10% of 
the controls will be incorrectly classified as 
creative. As proposed by Levy (1967), the 
“substantive significance” of a group differ- 
ence may also be evaluated in terms of the 
percentage of misclassification, defined as the 
percentage of individuals deviating from the 
mean of their own group more than halfway 
toward the mean of the other group. This 
percentage, which can be readily found from 
the point-biserial correlation, is 21 for the 
CrA key and 36 for the CrS key, as compared 
to a chance expectancy of 50. 

Once the validity of the initial keys (de- 
rived from one-half the available cases) was 
established, the final keys were constructed 
with items whose compound probability was 
based on data obtained from the total sample. 
The CrA key thus developed contains 140 
scorable items, while the CrS key contains 
65. Examination of the items in these final 


In order to avoid hegative scores in computa- 
tion, 70 was added to each score on the Artistic 
key and 40 to each score on the Scientific key. This 
adjustment, however, did not exclude negative scores 
from the total possible range, as shown in the 
Table 1 footnotes, 
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keys provides the most conclusive available 
description of the characteristics of the crea- 
tive students surveyed in this investigation; 
this description will be summarized in the 
next section. 


Discussion 


Identifying Creativity in Artistic versus Sci- 
entific Fields 


In all comparisons—including mean differ- 
ences, criterion correlations, extent of overlap 
of score distributions, and number of signifi- 
cantly differentiating items in initial and final 
keys—the CrA students were more sharply 
differentiated from their controls than were 
the CrS students. It should be recalled in this 
connection that, in both groups, creative and 
control Ss were equated in GPA and that 
CrS students significantly excelled CrA stu- 
dents in this regard. From many sources of 
evidence it appears that the CrS high school 
student resembles the academically superior 
student more closely than does the CrA stu- 
dent. Both the nature of scientific creativity 
itself and current educational practices in 
science courses suggest that high academic 
achievement—together with its characteristic 
antecedents and correlates—plays a predomi- 
nant role in scientific creativity. On the other 
hand, the traditional activities of artists and 
writers, as well as the content of courses in 
these areas, differ quite conspicuously from 
typical academic activities. While creative 
students in both artistic and scientific fields 
can be significantly differentiated from ac- 
ademically equated controls by means of a 
biographical inventory, this study strongly 
suggests that the former can be more readily 
differentiated than the latter. 


Correlates of Creativity across Both Fields 


An examination of items common to the 
final CrA and CrS keys provides a picture 
of the correlates of creativity in high school 
boys, regardless of field. In terms of specific 
scorable items, only 14 differentiated in the 
same direction at the .05 level or better in 
both final keys. This number is somewhat 
of an artifact, however, since there were many 
other differently worded but basically equiva- 
lent items in the two keys. Based upon both 


identical and equivalent items, the following 
description focuses upon the best established 
characteristics which were corroborated by 
several items. It might be added that even 
these common characteristics tended to be 
revealed in more items in the CrA than in the 
CrS group. 

The familial background of the creative 
students was not only academically superior, 
but the parents tended also to provide role 
models of interest and creative expression in 
the student’s field. College education was more 
frequent among parents of creative students, 
especially among mothers. Magazines regu- 
larly available at home were more likely to 
be of the political commentary-foreign affairs 
or the cultural-intellectual types. Creative 
students were less likely than controls to 
come from homes in which neither student 
nor parents play a musical instrument. With 
regard to specific field of interest, both fathers 
and mothers in the CrA group read more 
books than those in the control group, and 
reading was more often listed as the father’s 
favorite leisure-time activity. The fathers had 
more often won honors or awards in literary 
or art fields. The mothers had more often 
specialized in art or literary subjects in col- 
lege, had more creative hobbies, more fre- 
quently visited art museums or galleries, and 
were more often listed as the family member 
who had taught the student most about art. 
CrA students were also less likely than con- 
trols to come from families having no hobbies. 
The CrS students more often than the con- 
trols reported that science magazines were 
read regularly in the home and that their 
fathers had scientific hobbies. It is also rele- 
vant to note that in the CrS group one or 
more siblings were likely to have received 
national recognition for academic achieve- 
ments. 

In a number of ways, the creative student 
himself manifests a strong intellectual and 
“cultural” orientation. In both artistic and 
scientific groups, creatives more often than 
controls reported that they were considering 
an Ivy League college. In both groups, a sig- 
nificantly greater number of creatives reported 
that they read over 50 books a year. Other 
relevant items, occurring in either artistic or 
scientific groups, pertain to types of news- 
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papers and sections of newspapers read 
regularly, membership in academic and honor 
societies, and interest in plays, music, and 
art museums. Both creative groups, on the 
other hand, reported significantly less interest 
in sports than did their respective control 
groups, as evidenced by attendance at spec- 
tator sports, reading about sports, or even 
daydreaming about them. 

A third set of items can be characterized 
as indications of pervasive and continuing 
enthusiasm for one’s chosen field and single- 
ness of purpose in its pursuit. Early mani- 
festations of creativity are evident in several 
items, such as childhood creation of new 
games, mechanical or electronic objects, 
poems, stories, or art products. Persistence 
of interest in the same field is indicated in 
elementary school awards and in favorite and 
best subjects in elementary school. Creative 
students more often than controls received 
awards in their special field outside of school 
and they were more likely to own such field- 
related articles as microscopes, chemistry 
sets, and artists’ supplies. They also reported 
having become so absorbed in a project in 
their field that they skipped a meal or stayed 
up very late. They more often chose hobbies 
in their special fields, which were usually 
directly related to their vocational goals. 
Extracurricular activities, membership in out- 
of-school organizations, magazines read, and 
summer jobs all illustrate the pervasiveness 
of their interest. The creative adolescent evi- 
dently finds his chosen work so satisfying and 
refreshing that he is willing to devote his 
leisure time to it. He chooses an area that 
is personally and deeply valuable to him 
and is hence able to persevere at it. 

The creative student’s perseverance and 
goal orientation are not to be construed as 
narrowing influences, however, since they are 
likely to be accompanied by a characteristic 
breadth of interest. Thus creative students 
in both fields were more likely than their 
controls to have pursued a variety of hobbies 
(frequently since childhood), to have collec- 
tions of different sorts, to play muscial in- 
struments, to attend plays, and to participate 
in extracurricular activities. In such extra- 
curricular activities, moreover, they were more 
likely to have held offices and to have been 


elected to the highest offices, such as president 
or editor. Another example is provided by the 
CrA students who, when asked to check their 

favorite high school subjects, checked more 

subjects than did the controls. 

Finally, certain items suggest that creative 
students exhibit a stronger drive toward 
novelty and diversity in their activities than 
do controls, To some extent, the previously 
discussed breadth of interest may be regarded 
as a manifestation of this drive. But other 
illustrations can be cited. For example, crea- 
tive students reported more frequent prefer- 
ence than controls for unusual art forms, such 
as Pop, Op, and primitive art; they had more | 
unusual collections, such as road signs and 
spider webs; and they more often owned 
different kinds of records, such as classical 
and folk-song. Similarly, the tendency of crea- 
tive students in both fields to prefer friends 
both older and younger than themselves may 
indicate a desire for diversity in interpersonal 
relations. 


Differences between Creativity Correlates in 
Artistic and Scientific Groups | 


A direct comparison of the biographical | 
inventory responses of CrA and CrS students 
would be contaminated by differences in 
school and class distribution and GPA. In 
the present experimental design, each creative 
group was equated with its own control group 
in these variables. Hence the question now to 
be considered is whether the characteristics 
that differentiate the CrA students from their 
own controls differ in any systematic way 
from those that differentiate the CrS students 
from their controls. This question can be 
answered by comparing the two final biograph- 
ical inventory keys. 

The most conspicuous difference between 
the CrA and CrS keys occurs in a set of items 
indicative of the CrA student’s greater ex- 
posure to environmental diversity, through 
either personal or parental experiences. The 
parents of the CrA students were more likely 
to have been born in a different city, state, or 
country than the one in which the family was 
living. In the CrS group, the reverse relation 
was found, the parents being Jess likely than 
those in the control group to have been born 
in a foreign country and more likely to have 
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been born in the home city. CrA students 
were also more likely than controls to have 
moved from one state to another during their 
lifetime and to have visited more different 
parts of the country and more distant parts 
than the controls; they had also more often 
traveled out of the country—to Mexico, the 
Caribbean, and South America. 

Several items in the CrS key, give evidence 
of some conventionality in both parents and 
students, evidence that is certainly absent 
from the CrA keys. The CrS students were 
more likely than their controls to have fathers 
who majored in business subjects in college, 
who play bridge as a hobby, and from whom 
the students had learned about sports. The 
CrS students themselves participate in in- 
dividual-type high school sports such as tennis 
more than do the controls, although no 
significant group difference was found in 
team sports. In contrast, CrA students spend 
less time participating in sports of any kind 
than do their controls. These and other items 
also suggest a stronger sex-role identification 
among the CrS than among the Cr students, 
the former tending to enjoy closer relation- 
ship with the father, the latter with the 
mother. P 

With regard to childhood experiences, the 


Ers students reported less evidence of social 


participation in early childhood than did the 
controls. They more often stated, for example, 
that they had only one close friend and were 
never a leader of their childhood group ac- 
tivities under the age of 12. Although the 
picture of early childhood that emerges from 
the CrA key does not contrast with that of the 
CrS key, it nevertheless highlights a some- 
what different pattern of experiences and 
traits. Thus the CrA child was more likely 
than the controls to daydream and to have 
imaginary companions; his earliest childhood 
memory was more likely to date back to the 
age of 4 or earlier; and he had more unusual 
experiences, such as eidetic-type imagery. 
His relationships to his parents appear to 
have centered more often on shared interests 
in artistic or literary pursuits and less often 
on a warm personal relationship than was true 
of the control Ss. For example, the CrA 
student more often described his mother as 


enthusiastic and less often as understanding 
or sincere. He attended nursery school more 
often and did not describe family discipline 
as “always fair” as frequently as did the 
controls. If he had a personal problem, he 
was more likely to seek help from “several 
persons” than from his mother. 

A few noteworthy differences emerged with 
regard to school experiences, Preference for 
artistic and literary activities appeared in the 
CrA students’ reactions to their elementary- 
school curriculum, in questions pertaining to 
favorite subjects and subjects in which the 
students did best or poorest. In contrast, no 
items relating to the elementary-school cur- 
riculum differentiated CrS students from con- 
trols, these differences first appearing at the 
high school level. This finding may indicate 
that elementary-school instruction offers more 
creative opportunities in artistic and literary 
fields than in science, Both groups of crea- 
tive students gave some differentiating re- 
sponses to questions about outstanding teach- 
ers, but the focus of their replies differed. 
The CrA students reported having had two 
or more such teachers more often than did the 
controls. The CrS students differed from the 
controls in their more frequent listing of 
“wide knowledge” as the principal character- 
istic of such teachers and also in naming 
more than one trait to characterize them. 
It is also of some interest to note that CrS 
students tended to give more “uncertain” 
answers in the biographical inventory as a 
whole, a finding that may be related to the 
habits of caution fostered by scientific train- 
ing. * r 

In conclusion, creative students in artistic 
and scientific fields do share important com- 
mon features in experiential background and 
personal traits, but they also exhibit notable 
differences in both respects. Consequently, any 
investigation of creativity that fails to con- i 
sider area of creative achievement may pro- 

„duce a blurred or distorted picture of the 
creative person. Although the distinction be- 
tween scientific and artistic creativity is un- 
doubtedly a major one, further breakdown 
of artistic creativity into literary production 
and graphic art might be fruitfully explored 
in further research, : 


48 


Baxra, P. C. Combining tests of in 
cross-validation. Educational and 


Psychology, 1965, 49, 318-321. 


Cuamarss, J. A. Relating ality and biograph- 
ical factors to — eat. Psychological 
Monographs, 1964, 78(1, Whole No. 884). 

Davw, D. C. Life of original thinkers 
and good claboratots. Exceptional Children, 1966, 
32, 433-440. * * 

Gerzets, J. W. & Jacxson, P. W. Creativity and 
intelligence. New York: » 1962. 

Gunronn, J. P. faces of intellect. American 
Psychologist, 1959, 14, 469-479. 


21, 247-249. 

TELL, R. A. — off iliem gnalyss 
Educational and Psychological M „ 1951, 
11, 16-22. 


Levy, P. Substantive significance of significant dif- 
ferences between two groups. Psychological Bul- 
letin, 1967, 67, 37-40. 

jos, D. W. The nature and nurture of 
“creative talent. American Psychologist, 1962, 17, 


484-495. r 
McDeraap, C. D. Some correlates of creativity in 


R personnel. Journal of Applied Psy- 
„1965, 49, 14-19. 

Nicuots, R. C. & Hottawn, J. L. P. of the 

first year college performance of aptitude 


REFERENCES», 4), * 


„ ee and an 
biologists 


students, Monographs, 1963, 770 
Whole No. 570). 


biologists. 
Psychological Monographs, 1951, 65(14, Whole 
No, 331). (a) 


Ror, A. A psychological study of physical scien 
Genetic Psychology Monographs, 1951, 43, 121- 
235. (b) * s 

Ror, A. A psychological study of eminent psycholo- 
logists, and a comparison with 

Physical tists. Psychological 
Monographs, 1953, 67(2, ole No. 287). , 

Scuarrer, C. E. Biographical inventory correlates of 
scientific and creativity in adolescents. Un- 
published doctoral dissertation, Fordham Uni- 


versity, 1967! 0 
Sacre, W. J., L. E., GLENNON» J. R., & 
Owens, W. A. The 0 com- 
petence and creativity from nal history. 


Journal of Applied Psychology, 45, 59-62. 
Taytor, C. W., Coote, G. N., & 
Identifying high school students wi 


University of" Utah, 1963. 
(Mimeo) r 


te 
Tavtor, C. W., Extisox, R. I., & Tucxkz, M. 
Biographical information and the Prediction 
multiple criteria of success in science. Greensboro, 
N. C.: Richardson Foundati P 


(Received February 23, 1967) n 
$ * 
E * 
w > 3 
* s 
* 
+. » 
w 
* a 


se 


ad ee eae 


. 


3 i 5 f 
AUTOMOBILE ACCIDENTS AND DRIVER REACTION 


T 


P PATTERN ' . # 


— 


PAUL BABARIK* 


Accident Research Center, American Institules for Research, 3 D.C. 
2 è 


were 


» vehicle. However, drivers with such an atypical 
fewer accidents in their headways and t 
rates. Suggestions for further research and 


findings were discussed, 


> 
e e 1 ` * z > 
Motor-vehicle accidents are not a homo- 
geneous class of events with a common set 
of causes but heterogeneous events with 


multiple causes. In spite of this, much ac- 


., limited progress in acciden 


0 


“perceptual-motor va 


p 


research success. Althou 


cident research uses such criterion data as 
total number of accidents during the previous 
S.yr., or overall accident and violation rates. 
This practice may have contributed to the 
h. Focus- 
ing on one normal aspect of driving such as 
stopplte, | one kind ident, the struck- 
from-behind accident, should result in greater 
riving is a per- 
ceptual-motor skill, few relationships between 
iables and accident cri- 
terion ‘data have been found (e.g., Goldstein, 
1962; Lauer, 1959). The abandonment of 
perceptual-motor variables does not neces- 
sarily follow from these findings. Why not 
consider perceptual-motor variable patterns 
or combinations and relate these to the be- 
havioral patterns of drivers that might lead 
to accidents? Drake (1940) considered per- 
ceptual-motor tests in combination and 
found that individuals who. performed faster 
on motor tests than perceptual tests had more 
industrial accidents than individuals whose 
perceptual speed was faster than their motor 
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Although the existence of either a general 
perceptual speed factor or of a general motor 
speed factor has been d (eg. Fleisch- 
man, 1953, 1954), the possibility that a 
refined pattern of component perceptual- 
motor factors should be associated with 
specific driving acts, if not 
success, has to be considered as an alternative. 
Of course, such hypothesized predictor vari- 
ables must be related to more refined driving 
measures than overall accident rate. Accord- 
ingly, the primary aim of this study was to 
determine the relationship between one pat- 
tern of perceptual-motor behavior one 
type of accident. The perceptual-motor be- 
havior considered was the ratio of simple 
reaction time (RT) to jump reaction time 
(JRT) and the type of accident was the rear- 
end collision or the struck-from-behind ac- 
cident. 

The 


struck-from-behind accident was 


chosen as the criterion variable not only be- 


cause of the common-sense likelihood of a 
link with reaction pattern, but also because 
this kind of accident is numerous enough to 
yield sufficient cases for statistical analysis; 
for example, in one recent state study this 
class of accident accounted for 25% of all 
highway accidents, and what is of even greater 
significance, 30% of all injuries (State of 
New York, 1964). In freeway driving, where 
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there are no intersections and all trafic is 
moving in the same direction, more than 50% 
of the accidents fall into this category, 

In the present study, response time on 
reaction-time tests had been fractionatéd into 
initiation time and movement time. Initiation 
time is the time required to begin the move- 
ment in response to the stimulus. The time 
to complete the perception reaction as rep- 
resented in thé jump reaction time (JRT) 
test is made up of initiation time plus move- 
ment time.’ 

In the present study, it was hypothesized 
that drivers whose initiation time is an ab- 
normally long portion of their total percep- 
tion reaction (i e., drivers who have a high 
ratio of RT to JRT) would have a higher 
percentage of accidents in which they were 
struck from behind than would drivers whose 
perceptual-motor RT shows a typical ratio, 
Why should there be any difference in the 
stopping characteristics of two drivers, both 
of whom take .60 sec, for their total percep- 
tion reaction, but who differ in the length of 
‘the initiation time and movement com- 
ponents? Why should there be a difference 
nyt: tie A who typically takes .15 sec. 
to ini the response and .45 sec. for move- 
ment, thus has a reaction-pattern Score of 


20 or 28, and Driver B who takes 30 sec 
for movement, thus has a reaction-pattern 
dot Sor A vehicle's stopping is 


score of 60 
partly a function of the force that is applied 
by the driver, and force equals Mass X Ac- 
celeration (F = MA). Therefore, it was hy- 
pothesized that the driver who makes up 
for long initiation time with fast movement, 
that is, has short movement time, has greater 
limb acceleration and therefore applies greater 
force and stops his vehicle in fewer feet after 
contact with his brake pedal is made and 
his brake lights come on. Actually with such 
compensating limb speed, not only is the ac- 
The author is indebted to A. Zavala for this 
information on the RT and JRT tests. These com- 
ponents have been identified in factorial studies 
(eg, Fleishman, 1954) as the reaction-time factor 
(RT) and the speed of limb movement factor 
(SLM). Reference to these components as initiation 
time and movement time is consistent with both 
factorial findings and logical analysis, 
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celeration increased, but the mass is increased 
as additional g forces enter into braking to 
make the stop even more abrupt and un- 
repeatable by a following driver. As total 
reaction time is transformed into reaction dis- 
tance, drivers with this reaction pattern typi- 
cally stop in less road distance or within the ö 
expected . mie of the following driver. By 
systemat leaving less distance between | 
themselves and following drivers, such drivers 
increase their probability of being struck 
from behind, especially when such hazards — 
as tailgating and slippery roads exist. In 
effect, long initiation time followed by com- 
pensating short movement time probably 
desynchronizes the traffic flow or constitutes | 
a desynchronizing reaction pattern. 

The struck-from-behind accident is a " 
prima-facie extreme instance of such a re- 
action pattern. While the principal hypothesis 
was that drivers with a desynchronizing re- 
action pattern will have a higher incidence of 
struck-from-behind, accidents than drivers 
with normal reaction patterns, if the atypical 
drivers stop short, the corollary hypothesis 
that they should collide with vehicles in their 
headway less frequently would, also follow 
and was also tested. Although struck-from- 
behind accidents constitute a large proportion 
of accidents, the majority of accidents still 
occur in one’s headway. Therefore, it would 
follow that drivers with a desynchronizing 
reaction pattern would have à lower overall 
accident rate. 

The question might well be raised as to 
why the probability of a struck-from-behind 
accident is not primarily a function of a 
simple perceptual-motor factor movement 
time (as measured by IRT — RT), rather 
than as a function of a less parsimonious 
and more tricky ratio, the desynchronizing 
reaction pattern (as measured by RT/JRT). 
First, it is not more parsimonious to use 
movement time, as movement time is also a 
derived score, obtained by subtracting RT 
from JRT. Second, to use movement time 
as a predictor of the accident criterion would 
be to ignore previous findings of no relation- 
ships between movement time and accidents 
(Goldstein, 1962). Third, as noted above, 
Drake (1940) found a relationship between 
perceptual-motor ratio and accident proneness 
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laboratory JRT only in a forced or emergency 
stop. The driver whose initiation 
usually longer will have to utilize his shortest 
movement time more often than the 

with, faster initiation time. While it is gran 
that movement time determines the 
distance, other things being equal, it seems 
reasonable to assume that short movement 
time will increase the probabilities of struck- 
from-behind accidents only when it follows a 
long initiation time that necessitates a 


would not correlate with accident incidence. 
Since measures of RT and JRT were to be 
obtained in computing the desynchronizing- 
reaction-pattern and the movement-time fac- 
tors, possible relationships between these two 
variables and struck-from-behind accidents 
could also be examined. Following Goldstein 
(1962) it was also hypothesized ad hoc that 
no relationship between the RT and JRT 
tests and accident incidence exists. 


METHOD 


The data upon which this study was based were 
obtained from 127 Washington, D. C. taxicab 
drivers. The study began with 160 Ss. However, 
only the 127 experienced cab drivers—those with 
2 or more yr, full-time hacking or equivalent (part- 
time hacking was considered as one-half time)— 
were retained, The Ss averaged 48 yr. of age. Each 
S was asked for the number of accidents he had 


had in the past 5 yr. He was then asked to de- 


scribe each accident. Drivers were first classified 
according to their overall accident rates into three 
“groups: (a) O accidents or accident free, (b) 1 ac- 
cident, and (c) 2-9 or multiple accidents, Then 
those drivers who reported one or more accidents 
were classified according to kind of accident— 
struck-from-behind and other-than-struck-from-be- 
hind (Other), or accidents in their tailway and 
accidents in their headway. z 

The drivers were administered two kinds of RT 
tests. One was simple reaction time (RT) where 
S’s finger rested on a microswitch which was de- 
pressed in response to a light signal. The second 
test was jump reaction time (JRT) where S 
had to move his hand 12 in. from a rest position to 
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SrrucK-FROM-BEHTND (SFB) 


ACCIDENTS 
Total 
no. 
k — 3 
No. SFB accidents : 
0 (only Other) accidents 46 
1 accident 40 


2-8 multiple 
Total 


Note. = 9.19; p < .02. 
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TABLE 2 


COMPARISON OF TAXICAB Drivers wir A Desyn- 
CHRONIZING REACTION PATTERN (DRP) axb 
NORMAL REACTION PATTERNS WITH RESPECT 
TO MULTIPLE STRUCK-FROM-BERIND 
(SFB) AND MULTIPLE OTHER 


ACCIDENTS 
Reaction 
pattern Total 
no, 
Normal | DRP 
Multiple (2-8) SFB accidents 8 10 18 
Multiple (2-8) Other 
accidents 32 6 38 
Total multiple-accident 
drivers 40 16 56 


Note.—x? = 9.86; p <.001. 


pared with 10% of the 76 drivers with a 
normal reaction pattern. 

While a single accident may be a chance 
event, multiple accidents of the same type 
Present unusually strong support for the 
Operation of a systematic variable and causal 
link. So, drivers with two or more accidents 
were compared further. The double-edged 
effect of the atypical reaction pattern is shown 
in Table 2. Only 6 out of 16 (or 38%) of the 
multiple-accident DRP group had multiple 
Other accidents, whereas 32 (80%) of the 40 
drivers with a normal reaction pattern had 
their accidents “up front.” The corollary hy- 
pothesis that drivers with the desynchronizing 


TABLE 3 


Comparison oF TAXICAB DRIVERS WITH A Desyn- 
CHRONIZING REACTION PATTERN (DRP) AN 
Norwat REACTION PATTERNS WITH 
RESPECT ro OVERALL 


ACCIDENTS 
Reaction pattern 
No. accidents FTF A 
Normal DRP 
0 11 12 23 
if 32 9 41 
2-9 44 19 63 
Total 87 40 127 


Note.—x? = 6.35; p < 05. 


reaction pattern have fewer Other accidents 
was thus supported. 

Having considered the relationships of the 
reaction pattern to accidents in one’s tailway 
and also in one’s headway, the safe driver— 
one who had neither class of accidents or was 
accident-free—was examined next. In Table 3 
the accident-free drivers were compared with 
drivers who had either headway or tailway 
accidents. Thirty percent of the 40 drivers 
with a desynchronizing reaction pattern were 
found to be accident free compared to only 
13% out of the 87 drivers with normal re- 
action patterns. When the number of ac- 
cidents was examined, the Normal group re- 
ported 2.01 accidents in the preceding 5 yr., 
whereas the DRP group reported 1.65 ac- 
cidents over the same period. On two-way 
roads with cross traffic the majority of ac- 
cidents occur in one’s headway. The Other 
accident rate reflects this fact. However, in 
one-way driving without intersections the 
struck-from-behind accident is often the pre- 
vailing kind of accident. When most driving 
occurs under such circumstances, then the 
driver with the desynchronizing reaction 
pattern may well have the highest overall 
accident rate and no longer be the safer 
driver. 

Only the desynchronizing-reaction-pattern 
Score accounted for accident variance when 
chi-square contingency tables were used. 
When product-moment correlation was util- 
ized to compare the four possible variables, 
no relationship was found between the single 


TABLE 4 


CoRRELATION OF REACTION-TIME VARIABLES AND 
STRUCK-FROM-BEHIND (SFB) 


ACCIDENTS 
e 
Initiation time (IT) and no, SFB accidents 13 
Jump reaction time (JRT) and no. SFB 
accidents 05 


Movement time (JRT — IT) and no. SFB 


accidents 223% 
977 i RT 
Desynchronizing reaction pattern JRT and 
no. SFB accidents 208" 


2 <.05. 
> < 101. 
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perceptual-motor scores of RT and JRT and 
accident incidence. Contrary to hypothesis, 
movement time was significantly correlated 
with struck-from-behind accidents. However, 
the finding that the correlation of movement 
time was lower than the correlation of de- 
synchronizing reaction pattern with struck- 
from-behind accidents could be taken as sup- 
port for the general thesis that while the 
speed of movement time determines the stop, 
the stop becomes more hazardous when move- 
ment time is combined with a slow initiation 
time as represented by the desynchronizing 
reaction pattern. 


DISCUSSION 


The finding of an empirical relationship, 
if confirmed by further studies, has practical 
significance. However, considerable research 
is called for before it would be clear how 
this finding would pay off. The crux of such 
research would be to test the assumption 
underlying the desynchronizing reaction pat- 
tern, that is, that the DRP driver’s percep- 
tion reaction in stopping results in system- 
atically less space between him and the fol- 
lowing car. The arrangement used in re- 
search on car following (Gazis, Herman, & 
Rothery, 1961) could measure the hy- 
pothesized differential space relationship re- 
sulting from the desynchronizing reaction 
pattern and other variables considered in this 
present study. If the hypothesized hazard is 
confirmed and specified quantitatively, fur- 
ther research to determine whether the de- 
synchronizing reaction pattern is relatively 
fixed or immutable would be called for. 
Implications along the following lines might 
follow if the desynchronizing reaction pattern 
were found to be relatively stable: 

(a) Selection and driver testing. Taxicab 
drivers, chauffeurs, patrolmen, and com- 
mercial drivers could have a test of their re- 
action pattern included in their selection tests 
and the results put on record. 

(b) Training. Follow-up counseling and 
instruction procedures could be tried and, if 
found effective, incorporated into driving 
training in general. Specific counseling that 
one is prone to a specific kind of accident, 
in this case to being struck from behind, 
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might prove more effective than general warn- 
ings to be careful. 

(c) Legal. This measure could constitute 
evidence for assigning a greater portion of the 
costs and culpability to a leading driver, 
This might not only be more equitable than 
the present tendency to charge the following 
driver, but also provide an incentive to the 
antecedent driver to compensate for his de- 
synchronizing reaction pattern. Such a driver 
might also have a warning fixed to the rear 
end of his car after he had a struck-from- 
behind accident to caution following drivers. 

(d) Human engineering. Tf the perceptual- 
motor regularity studied here is established 
as a variable in accidents, changes in the 
braking system might be called for. These 
might even take such forms as dampening 
the application of braking force for some 
drivers, more effective signaling of stopping 
to the following driver, and perhaps adjusting 
the braking system to the driver as one ad- 
justs the seats. 

If this pattern is mutable, then changing 
the desynchronizing reaction pattern would 
be called for. The most obvious way would 
be to reduce the initiation-time factor and 
thus bring the perception-reaction pattern 
into line with typical drivers. This would 
assume that the pattern has a large attitudinal 
rather than psychophysiological basis. One 
possible and feasible way of altering the re- 
action pattern is to slow down the movement- 
time component so that it is no longer com- 
pensatingly fast. 

A slowing of movement to decrease a 
hazard and probability of accident seems in- 
credible. Would not a reduction of accidents 
in one’s tailway be at the expense of more 
accidents in one’s headway? A post hoc ex- 
amination of the 40 DRP drivers brought to 
light 6 men who had both very long RT 
and long JRT; that iš, their movement time, 
while short in relation to their initiation time, 
was longer than the sample mean or not fast 
in absolute terms. It was found that this 
group had a mean struck-from-behind ac- 
cident rate of .33 (total sample mean = .74), 
with only one driver reporting any struck- 
from-behind accidents; he reported two. This 
group reported a total of five accidents, or a 
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mean rate of .83 (total sample mean = 1.83). 
Three of these six drivers were accident free. 
While no conclusions are warranted on the 
basis of this small group alone, the hypoth- 
esis, that a slowness in initiating a response 
without compensating limb speed is associated 
with the below-average accident rate, calls 
for further investigation. It may be that 
these drivers learned to live with their slow 
reaction pattern. If subsequent studies were 
to confirm that such a reaction pattern does 
not lead to more headway accidents, and 
results in a lower rate of tailway accidents, in 
a roundabout way an aspect of safe driving 
would be identified. 
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FACULTY SATISFACTION AND THE DEAN’S INFLUENCE: 
AN ORGANIZATIONAL STUDY OF TWELVE LIBERAL ARTS COLLEGES ' 


JERALD G. BACHMAN 
Institute for Social Research, University of Michigan 


Questionnaire data were obtained from 685 faculty members in 12 liberal 
arts colleges. Measures included amounts of administrative influence, bases of 
influence for the dean and faculty, and levels of faculty satisfaction. Faculty 
satisfaction was greatest in those colleges having the highest total influence 
across all levels of the academic hierarchy. Faculty also indicated greatest 
satisfaction with deans who have relatively high influence over college affairs, 
and whose influence is based upon expertise and respect rather than upon 
legitimate authority and coercion. The results are discussed in the light of 
similar findings in business organizations. 


The research reported in this paper has two 
basic purposes. The first is to provide some 
tentative, but hopefully useful, guidelines for 
college administrators. A second and broader 
purpose is to explore the parallels between 
administrative processes in higher education 
and those in other kinds of organizations, 
especially industrial ones. Studies of this sort 
should guide our efforts to apply the results 
of organizational research to the academic 
setting. And perhaps more important, they 
should tell us something about the possibilities 
for developing general theories of administra- 
tion, applicable to business, governmental, 
political, voluntary, and academic organiza- 
tions. The present paper, which focuses on 
the role of the academic dean, is the first of 
several reports based on this research in 
liberal arts colleges. 

In many respects, the academic dean is the 
faculty’s most important voice in the adminis- 
tration of the liberal arts college. He repre- 
sents the faculty viewpoint to the president 


1The research reported here was supported by 
grants from the Carnegie Corporation of New York 
to the Survey Research Center and the Center for 
Study of Higher Education, The University of 
Michigan. 

Data for the present analysis are taken from a 
larger survey carried out by A. L. Fisch and the 
author. Special thanks are due to the many faculty 
members who devoted their time and effort to 
provide the information for this study. 

Thanks are also due to the following colleagues 
who read an earlier draft and offered suggestions: 
Robert Cope, Glenn Jones, Arnold Tannenbaum, 
George Wieland, and Alvin Zander. 
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and trustees, and he also represents higher 
administration to the faculty. His role is thus 
similar to many middle-management positions 
in business organizations. Given this set of 
dual loyalties and commitments, what are 
some of the qualities of an effective dean? 
If for the moment we are guided by research 
findings in nonacademic organizations, the 
following hypotheses may be considered: 


1. The effective dean is influential. A study 
in industry by Pelz (1951, 1952) indicated 
that a supervisor’s effectiveness in dealing with 
his subordinates depends in part upon his 
influence with his own superiors; the in- 
fluential supervisor can “deliver the goods” 
for his men. A recent study of account execu- 
tives engaged in the sale of intangibles 
(Bachman, Smith, & Slesinger, 1966) pro- 
duced similar results; satisfaction and per- 
formance tended to be higher when the office 
manager was perceived as highly influential 
over “how the office is run.” To the extent 
that these findings are applicable to an aca- 
demic situation, they imply that the “strong” 
dean will be effective. 

Tt does not follow that if the dean is 
highly influential then someone else in the 
college administration will be proportionately 
less influential. Recent studies suggest that 
the total amount of influence (or control) in 
an organization is best treated as a variable, 
not a fixed value; indeed, a relatively high 
level of control across all organizational eche- 
lons is associated with high performance and 
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satisfaction (Tannenbaum, 1962). Tannen- 
baum (1961) and Likert (1961) have argued 
that this pattern of “high total control” is 
successful because it involves members at all 
levels of the organization, leading to more ef- 
fective decisions and also to higher motivation. 
This suggests a supplementary hypothesis: the 
effective college administration is one with 
relatively high influence at all levels. 

2. The effective dean’s influence is based 
upon a relatively high degree of competence 
and personal attractiveness, and a relatively 
low dependence upon his “legitimate au- 
thority” and coercion. Five different “bases of 
power,” originally distinguished by French 
and Raven (1960), were examined by Bach- 
man et al. (1966). Satisfaction and to some 
extent performance were positively related to 
“expert” power (based on competence and 
experience) and “referent” power (based on 
personal admiration); negative correlations 
occurred with “legitimate” power (based on 
hierarchical “rights”) and “reward” and 
“coercive” power (based on the potential use 
of positive and negative sanctions). 

The above hypotheses are stated in very 
broad terms. For example, they deal with the 
dean’s “effectiveness” as if it were a unitary 
dimension. In fact, of course, there may be 
many criteria for evaluating performance in 
the role of dean. The present research is 
necessarily limited to studying the dean’s 
effectiveness in terms of his impact on the 
faculty; specifically, the criteria will be 
several measures of faculty satisfaction, in- 
cluding their view of how well the dean ful- 
fills his responsibilities. 


METHOD 
Research Sites 


The respondents selected for study consisted of 
all full-time faculty members at each of 12 liberal 
arts colleges belonging to a regional association of 
colleges. This choice provided a number of institu- 
tions which are somewhat homogeneous with respect 
to size, geographic location, and educational reputa- 
tion. Since our interest is concentrated upon the 
effects and correlates of certain administrative fac- 
tors, it seemed desirable to select colleges which 
would be fairly similar in other characteristics. A 
second and more practical reason for the choice was 
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the willingness of member colleges to participate 
in the study.? 

At the time of the survey (April-May 1964), 
faculty size at the 12 colleges ranged from 67 to 
173, with a median of 92 and a mean of 101. 
A total of 1,210 questionnaires were sent to eligible 
faculty members; after several months, during which 
time two follow-up letters and a duplicate question- 
naire were sent to each nonrespondent, a total of 
685 were returned. While a higher response rate 
would have been preferred, of course, the final re- 
sponse (approximately 60% in each of the 12 col- 
leges) was consistent with the author’s expectations 
for a long (19-page) mail questionnaire administered 
near the end of the academic year.? 


Measures 


All data used in the study consist of faculty 
responses to questionnaire items. Only a small pro- 
portion of the total questionnaire material has 
been included in the present analysis. 

Influence. The following question was used to 
assess influence over college matters: “In general, 
how much influence does each of the following 
groups or persons actually have in determining the 
policies and actions of your colleges?” Response 
categories ranged on a 5-point scale from (1) 
“Little or none,” to (5) “A great deal.” The respond- 
ents were asked to rate the amount of influence of 
each of the following groups or persons: the board 
of trustees, the president, the academic dean, the 
department chairmen (all of them), the faculty as 
a group, and the students as a group. The present 
study uses two measures based on this item: a 
measure of “total influence” (as indicated by the 
mean of all six influence ratings), and the in- 
fluence of the academic deans (based on the rating 
of the dean alone). 

Bases of influence. The following questionnaire 
item was used to assess five bases of the dean’s 
influence over the respondent (referent, expert, 
reward, coercive, and legitimate influence, in that 
order) : 


Listed below are five reasons generally given 
by people when they are asked why they do the 
things their superiors suggest or want them to do. 


The problem of gaining access to a research 
site and obtaining the cooperation of prospective 
respondents is crucial in research of this sort. Special 
thanks are due to Algo D. Henderson and Eldon L. 
Johnson, of the Center for Study of Higher Educa- 
tion, for their help in assuring the full cooperation 
of each college in the association. 

It must be stated clearly that the purpose of the 
research based upon this sample of college faculty 
is not to describe “the average faculty member.” 
Rather, the primary intent is to discover relation- 
ships among variables; such relationships are likely 
to appear even if the set of respondents departs 
somewhat from an accurate probability sample 
drawn from the universe of faculty members. 
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Please read all five carefully. Then number them 
according to their importance to you as reasons 
for doing the things your academic dean suggests 
or wants you to do. Give rank “1” to the most 
important factor, “2” to the next, ete. 

A. I respect him personally, and want to act 
in a way that merits his respect and admiration. 

B. I respect his competence and judgment about 
things with which he is more experienced than I. 

C. He can give special help and benefits to those 
who cooperate with him. 

D. He can apply pressure or penalize those who 
do not cooperate. 

E. He has a legitimate right, considering his 
position, to expect that his suggestions will be 
carried out. 


An additional item was included on an exploratory 
basis to measure faculty perceptions of their in- 
fluence over the dean. The respondents were given a 
list of alternatives similar to those listed above, 
but in this case they were asked to “. , number 
them according to what you think is their importance 
to your academic dean as reasons for doing the 
things you suggest or request of him.” 

Because of the ranking procedure involved, the 
five bases of influence are not independent; in a 
sense, any single basis of influence can be given 
prominence only at the expense of the other bases. 
On the other hand, the ranking procedure has the 
advantage of forcing the respondent to discriminate 
among all the bases, rather than giving prominence 
to only one or two. Moreover, it avoids a common 
contaminating tendency to rate the amount of in- 
fluence rather than the nature or bases of influence. 

Faculty satisfaction. Three measures of satisfaction 
were used; two deal with overall “job satisfaction,” 
and the third specifies satisfaction with the dean. 

In order to obtain some indication of commitment 
or attachment to present college positions, each 
respondent was asked to indicate (a) his first choice 
of another college or university and (b) the mini- 
mum salary condition that would induce him to 
leave his present position to accept a similar one at 
the other college or university. Response alternatives 
covered a 7-point scale ranging from (0) “I would 
accept even if it involved about a 20% salary 
reduction,” to (6) “I would not accept such a posi- 
tion, no matter how great a salary increase might 
be involved.” 

A second measure of satisfaction simply pre- 
sented the respondent with the following state- 
ment, and asked him to indicate his agreement or 
disagreement on a 5-point scale: “While no job 
can be expected to be perfect, there are really very 
few things that I would change about mine if I 


had the power to do so.” 


The measure of satisfaction with the dean also 
presented a statement, and asked the respondent to 
indicate agreement or disagreement on a S point 
scale: “All things considered, I am personally quite 
satisfied with the way the academic dean of my 
college fulfills his responsibilities.” 
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TABLE 1 


COLLEGE ADMINISTRATIVE CHARACTERISTIC’ AND 
Faculty CRITERIA: MEANS AND 
Sranpaxp Deviations 


| a SD among| 5. 
rating | individuals = 
1 college Ms 
(N = 685) | (N = 685) | ZN = 12) 
Influence 
By dean 3.53 1.00 0.51 
Total influence 349 OAT 0.14 
Bases of dean's influence 
Legitimate 2.40 1.44 9.20 
Expert | 1.948 1.15 0.38 
Referent 248 1.09 0.19 
Reward 3.67 0.96 0.16 
Coercive 4.40 1.07 0,26 
Bases of faculty influence Í 
Legitimate 1.72 0.91 1 0.14 
Expert 1.93 | 0.92 0.11 
Referent 2.85 90.91 0.17 
Reward 366 | 0.86 0.16 
Coercive 4.78 | 063 0.13 
Faculty satisfaction | 
Minimum conditions | 
for leaving 3.53 | 1.66 0.41 
With job as a whole 3.29 tal 0.19 
With dean of college 3:56 | 113 0.45 


Clearly, these several measures do not relate to a 
simple unitary dimension of satisfaction. It is quite 
possible, for example, to find a faculty member who 
would rate his present position favorably, and yet 
be eager to go to a large and prestigious university— 
even at a slight reduction in salary. And that same 
respondent could be either satisfied or dissatisfied 
with the way the dean fulfills his responsibilities. 
Nevertheless, moderate positive correlations appeared 
among the three measures. The product-moment 
correlation between the two measures of overall 
satisfaction is .35; satisfaction with the dean cor- 
relates .20 with attachment to present position, and 
34 with the second measure of overall satisfaction 
(in each case p < 01, two-tailed test). 


RESULTS 


The major findings of the study are sum- 
marized in Tables 1, 2, and 3. Table 1 pre- 
sents mean responses and standard deviations 
for all variables. Tables 2 and 3 present cor- 
relations between college administrative char- 
acteristics and faculty criterion ratings at 
two levels of analysis (described below). 
Before presenting the substantive findings, it 
will be helpful to outline the procedures and 
rationale for data analysis. 


Statistical Analysis and Rationale 


The analytic approach used in this study 
closely parallels that used by Bachman et al. 
(1966). This approach assumes that the best 
available measure of an “organizational char- 
acteristic” is often some composite of percep- 
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TABLE 2 


COLLEGE ADMINISTRATIVE CHARACTERISTICS CORRE- 
LATED WITH MEAN FACULTY 
CRITERION RATINGS 


| College mean faculty criterion 


ratings 
College administrative 
characteristic Minimum | Satisfac- | Satisfac- 
condi- tion with | tion with 
tions for | job asa dean of 
leaving whole college 
Influence 
By dean -16 —.07 .69* 
Total influence 42 10 74 
Bases of dean's influence 
Legitimate —.24 —.82 
Expert 38 75 
Referent 40 67 
Reward =.31 —.80** 
Coercive —.44 —.70* 
Bases of faculty influence 
Legitimate —.10 42 AT 
Ex 44 .61* 62⁷ 
Referent 23 11 39 
Reward —.40 —.75** —.75** 
Coercive —.05 —.24 =T 


Note.—N = 12 colleges. K 
* p <.05, two-tailed, product-moment correlation. 
** p < 01, two-tailed, product-moment correlation. 


tions by a number of respondents in the 
organization. Thus in the present study the 
measure of a college administrative char- 
acteristic always consists of the mean of all 
faculty perceptions. Each characteristic is 
correlated with faculty satisfaction measures 
at two distinct levels of analysis: college mean 
criterion scores and individual criterion scores. 

College mean criterion scores. The first 
level of analysis is the college as a whole; 
Table 2 presents correlations computed be- 
tween administrative characteristics and mean 
faculty satisfaction scores (V= 12 colleges). 
An individual faculty member’s satisfactions 
with his position and dean are presumably 
influenced by (a) certain college-level ef- 
fects—administrative characteristics which 
are common to most of his campus colleagues, 
and also (b) a host of idiosyncratic factors 
which differ from person to person. The use 
of mean satisfaction scores largely cancels the 
effects of personal factors (b), and thus high- 
lights the impact of college-level effects (a). 

Individual criterion scores. The second level 
of analysis is the individual faculty member; 
Table 3 presents correlations computed be- 
tween administrative characteristics (still 
based on mean perceptions) and individual 
faculty satisfaction ratings (V= 685 faculty 
members). This form of analysis deals with 
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the question: Given the many causes of any 
individual’s satisfactions, what is the relative 
importance of certain college administrative 
characteristics? It is clear at a glance that 
the correlations in Table 3 are, on the whole, 
much smaller than those in Table 2. In 
Table 2 college-level effects have a “mo- 
nopoly” in their impact upon satisfaction, 
since individual differences are held more-or- 
less constant through the use of college mean 
satisfaction data. In Table 3 this impact of 
college-wide factors is “shared” with the many 
causes that may be different for each in- 
dividual. 

Isolation of college-level effects. An addi- 
tional question which can be treated at the 


TABLE 3 


COLLEGE ADMINISTRATIVE CHARACTERISTICS CORRE- 
LATED WITH ĪNDIVDUAL FACULTY 
CRITERION RATINGS 


Individual faculty criterion ratings 
lle: inistrativ F: ~ ihe 
eg race Minimum | Satisfac- | Satisfac- 

condi- tion with | tion with 
tionsfor | jobasa dean of 
leaving whole college 
Influence 
By dean 05 00 
—.02 —.08 
Total influence 1 .03 
07 —.03 
Bases of dean’s influence 
Legitimate —.06 —.07 
—.05 —.06 
Expert .09* -10* 
05 05 
Referent .11* .11* 
.09* .08 
Reward = 07. =t" 
—.07 —.09* 
Coercive —.11* =i 
—.08 —.06 
Bases of faculty influence 
Legitimate 00% 
09% 
Expert 09% 
08 
Referent 02 
00 
Reward =ar 
0 
Coercive —.01 
—.03 


Note.—N = 685 faculty members. For each cell in the 
table two correlations are given: The upper value is the zero- 
order correlation, rab; the lower (boldfaced) value is the partial 
correlation, 74b.a; where, 


A = college-wide mean of ratings of administrative char- 
acteristics 

b = individual faculty criterion ratings (satisfaction, etc.) 

a= 1 faculty ratings of administrative character- 
istics. 


As a matter of convenience, all significance levels were cal- 
culated on the basis of 500 df. Actual response varied some- 
what from item to item, but no combination of items (ab) 
provided less than 500 df, 

* <.05, two-tailed, product-moment correlation. 

** p < .01, two-tailed, product-moment correlation, 
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individual level of analysis has to do with the 
way in which administrative characteristics 
have an influence on individual satisfactions: 
Is it simply the perception of these character- 
istics which affects satisfaction? In order to 
deal with this issue, each relationship be- 
tween an administrative characteristic and in- 
dividual faculty satisfaction is presented in 
two forms in Table 3: the zero-order correla- 
tion and a partial correlation. The partial cor- 
relation rules out the effects of each individ- 
ual’s own perception of the college administra- 
tive characteristic, thus removing that portion 
of the relationship which might be attributed 
to “phenomenological” or “halo” effects. The 
portion that remains thus represents a “phe- 
nomenology-free” measure of the college-level 
effect. For a discussion of the background 
and rationale for this form of analysis, see 
Tannenbaum and Bachman (1964); its im- 
plications for the present study are treated 
later in this paper. 


Influence 


The mean level of influence attributed 
to the dean is midway between “moderate” 
and “considerable.” As Table 1 indicates, the 
standard deviation among college mean 
ratings is relatively large; thus it appears that 
the 12 deans vary considerably in their in- 
fluence. The mean total influence rating is 
also midway between “moderate” and “con- 
siderable”; however, the variation from col- 
lege to college is much smaller (perhaps be- 
cause influence in this case is averaged across 
six levels from board of trustees to students). 

Colleges in which the total amount of in- 
fluence is high, and those in which the dean’s 
influence is high, show greater mean satisfac- 
tion with the dean. These administrative 
characteristics are also related to individual 
faculty members’ satisfaction with the dean; 
however, the correlations with dean’s influence 
disappear almost completely when the effects 
of individual perceptions are removed through 
partial correlations (Table 3). The more gen- 
eral measures of faculty satisfaction (mini- 
mum conditions for leaving, and satisfaction 
with the job as a whole) show little relation- 


ship to our influence measures at either level 
of analysis.* 


Bases of Influence 


When asked to rank reasons for doing what 
the dean suggests or wishes, faculty members 
gave the highest ranking to the measure of 
expert influence (“I respect his competence 
and judgment about things with which he is 
more experienced than I”); next most im- 
portant were legitimate and referent influence; 
lowest rated were reward and coercive in- 
fluence, that is, the use of sanctions (Table 1). 
Correlations with the criterion variables were 
positive for expert and referent influence, 
negative for legitimate, reward, and coercive 
influence.“ 

Turning next to faculty perceptions of 
why the dean follows faculty suggestions or 
requests, it is clear that the results are largely 
similar to those for the bases of the dean’s 
influence. The important exception is legiti- 
mate influence. Faculty members indicated 
that the dean’s most important reason for 
acting as they wish is that “The dean feels 
that his job properly includes an obligation 
to consider and act upon my suggestions 
as a faculty member.” Moreover, this legiti- 
mate influence of the faculty over the dean is 
positively related to satisfaction with the 
dean, The ordering of the four other bases 
of dean’s influence, and the actual mean rank- 
ings are quite similar (see Table 1). The 
correlations with satisfaction measures are 


4It is of interest to note that the respondents 
were also questioned about the ideal amount of 
influence their dean should have. The level they 
recommended was consistently higher than their 
rating of the dean’s actual influence. The increase 
occurred in 11 of the 12 colleges; in the twelfth 
case, the estimates of actual and ideal influence were 
practically identical, and very high. 

5 Some caution must be exercised in interpreting 
correlations with the present measure of bases of 
influence. The ranking method used in obtaining 
the data makes it impossible for all five bases of 
influence to be correlated in the same direction with 
any single criterion variable. Thus, it may be that 
positive correlations with expert and referent influ- 
ence “caused” negative correlations with other bases 
of influence, or vice versa. In spite of this limitation, 
the overall pattern of correlations with bases of in- 
fluence is of considerable interest. 
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VALIDITY OF THE GUILTY-KNOWLEDGE TECHNIQUE: 
THE EFFECTS OF MOTIVATION * 


P. O. DAVIDSON 


University of Calgary 


48 Ss divided into 12 groups took part in an experimental investigation of 
detection of a simulated crime. For each of 12 crimes, 3 Ss were highly 
motivated to commit the crime; 1 succeeded, 1 attempted but failed, and 1 
did not make an attempt. The 4th S interrogated for each crime was an 
innocent control. Motivation for deception (hope of reward) was high ($25- 
$50) for half of the crimes and low (<$1) for the other half. Detection of 
deception using a polygraph recording of GSR and the guilty-knowledge tech- 
nique resulted in correct classification of 92% of “guilty” Ss and 100% of 
“innocent” Ss. No significant differences were found due to level of motivation 
for deception. Results were discussed in terms of their relationship to similar 


studies and actual criminal investigations. 


The first recommendation arising out of 
Orlansky’s (1962) review on lie-detection 
capability was for further investigation of the 
validity of lie-detection techniques. This view 
has been reiterated recently in investigations 
by the Committee on Government Operations 
(1965, 1966) of the United States House of 
Representatives concerning the use of poly- 
graphs as “lie detectors” by the Federal 
Government. A common problem in studying 
the validity of lie detection, although not 
commonly recognized by many lie-detector 
operators, is that the polygraph merely in- 
dicates that a physiological response has oc- 
curred. The operator must infer the lie on 
the basis of the observed physiological re- 
sponse, In spite of some extravagant claims 
(cf. Lee, 1953), the inference that autonomic 
responses can indicate lying must be viewed 
with extreme skepticism since the physiolog- 
ical responses measured by polygraphs can be 
produced by a wide variety of stimuli quite 
incidental to any process of lying. Lykken 
(1959) has suggested that the detection of 
guilt by using physiological measurements in- 
volves a more reasonable inference than the 
detection of lying. If a guilty individual is 
presented with some relevant details about his 


1 The author gratefully acknowledges the assistance 
of Geri Gold and the technical advice of Arthur R. 
Roberts of the Calgary City Police Department in 
carrying out this study which was supported in 
part by Canadian NRC Grant No. APA 213. 


crime interspersed among similar but ir- 
relevant items, he would be expected to re- 
spond differently than an innocent individual 
who had no “guilty knowledge” about which 
items were relevant. The assumption here is 
not that the subject (S) will respond in a 
particular, recognizable way when he is lying, 
but merely that he will respond differently to 
relevant details of his crime than to irrelevant 
details. 

Lykken found that he could correctly 
identify, using a simple objective scoring 
procedure, all innocent Ss and 93.9% of his 
guilty Ss against a chance expectancy of 50%. 
He noted, however, that his Ss were not par- 
ticularly emotionally involved in their crime, 
nor were they strongly motivated to try and 
defeat the test. In a later study he found 
this guilty-knowledge technique highly re- 
sistant to faking (Lykken, 1960). The present 
study was designed to evaluate the validity 
of the guilty-knowledge technique under more 
ego-involving circumstances and varying 
amounts of motivation. 


METHOD 


The Ss used in this experiment were 48 volunteer 
college students assigned at random to 12 groups. 
Three Ss chosen at random from each group took 
part in the crime as “hunters”; the fourth S did not 
and he had no knowledge of the nature of the 
experiment. The “hunter” game was used for enacting 
each crime since it seemed to be a highly ego- 
involving game for university students. The three 
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hunters from each group were seen individually by 
the experimenter (E) and shown a photograph of a 
student who was to be their intended victim (the 
12 victims were all collaborating psychology stu- 
dents). Basic information concerning the name of 
the victim, his faculty, and his lecture schedule 
was supplied. Each hunter was told that the 
victim would be carrying a sealed envelope which 
the hunter was to take and examine if he could 
successfully hunt down the victim and enact his 
murder. The hunter was informed that the envelopes 
contained vouchers for prizes ranging in value up 
to $50 and that he would be permitted to cash the 
voucher if he was not detected as the criminal after 
a polygraph examination. The hunter would not 
know the value of the voucher unless he was suc- 
cessful in enacting the crime. Although the hunters 
were not aware to it, half of the 12 victims had 
envelopes with vouchers ranging in value from $25 
to $50, while the other half ranged in value from 
10¢ to $1. Each hunter was informed that there 
were other hunters after the same victim and that 
the first successful hunter would be the one to get 
the envelope. Each hunter was free to choose his 
own method, time, and place for enacting the crime; 
the only restrictions were that he must work alone 
and tell no one of his plans or success. As an addi- 
tional incentive to the hunters, prizes were awarded 
upon completion of the experiment for the most 
ingeniously devised crimes, whether the hunter was 
successful in getting the envelope or not. 

The hunters were told that in order to add some 
uncertainty to the success of the crime the victim 
had been given a random timetable with specified 
time intervals on it when any crime attempted would 
not be successful. If an attempt was made during 
one of these “safe periods” the crime was said 
to have failed, the envelope was not surrendered, 
and the hunter was barred from trying again for 
24 hr. The hunter would not know until he made an 
attempt whether it would succeed or not. In fact, no 
such timetable existed and the first hunter to make 
an attempt was always told that his victim was in 
a “safe period” and the attempt had failed. The 
second hunter to make an attempt was always judged 
successful and took the envelope. 

Each victim had a code number which was posted 
on a central campus bulletin. This number was to be 
checked before making any attempt and to be re- 
moved as soon as any hunter was successful in ob- 
taining the envelope by enacting the murder of his 
victim. Thus, if the number was gone, the other 
hunters would know the game was over and would 
not make any further attempts on the victim. 

In this way, for every victim there were four 
suspects: one who had committed the crime, one 
who had tried and failed, one who was motivated 
to attempt the crime but had no opportunity, and a 
fourth “suspect” who in fact knew nothing about 
the crime or the victim and was in no way in- 
volved or motivated. 

As soon as the second hunter had succeeded in 
obtaining the envelope the victim supplied E with 


answers to some questions relevant to the crime 
(eg, where it occurred, murder weapon used, etc.) 
Six multiple-choice questions about the crime and 
the content of the envelope were made up with 
five alternatives for cach question (cf. Lykken, 1959). 
The first alternate on every question was always a 
foil and not scored. This was done in order to 
reduce the possibility of incorrectly scoring orienting 
responses. The four suspects were then brought in 
for interrogation by another E who had no knowl- 
edge about who the criminal was, or which answers 
to the multiple-choice questions were correct, The 
S was seated in the interrogation room, GSR 
electrodes were attached to his dominant hand, and 
a pair of headphones adjusted to his ears. The E 
was located with a Grass Model 7 polygraph in an 
adjoining room and spoke to S via a microphone. 
Each S was told he was to be questioned in relation 
to a crime. He was to listen to each question con- 
sisting of several parts but to make no answer. 
The standardized list of six questions was then 
read to S and his GSR responses to each of the 
five alternates were recorded. Heart rate and respira- 
tion rate were monitored but not scored. 

Scoring of the GSR response after the recording 
was completed was simple and objective (Lykken, 
1959). Scoring was done without prior knowledge 
of the correct answers in order to prevent any 
criterion contamination. An S’s GSR’s to the several 
alternatives in a given question were ranked in order 
of amplitude. If his largest response was found to 
be a relevant alternative he was given a score of 
2 on that question. If his second largest response 
was to the relevant alternative he was given a score 
of 1. Thus a perfect innocent score was 0 and 
perfect guilty score was 12. 


RESULTS 


Following Lykken’s (1959) criteria, all 
scores of 6 or less were classified “innocent” 
and all scores over 6 “guilty.” All six of the 
guilty Ss who were highly motivated to decep- 
tion because of the high value of their vouch- 
ers were correctly classified. Five of the six 
guilty Ss who were under low motivation to 
deception (negligible value of vouchers) were 
correctly identified. The difference in classi- 
fication between these two groups was not sig- 
nificant (Fisher exact probability p= 50 
[Siegel, 19561). Combining the two groups, 
11 of the 12 guilty Ss were correctly classified 
as guilty (92%). All 36 of the innocent Ss 
(100% ) were correctly classified as not guilty. 
There was a total of 48 interrogations and 47 
of these were correctly classified—giving an 
overall 98% correct classification against a 
chance level of 25%. 
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Discussion 


There were several reasons for the fairly 
involved procedure used in this study. Pri- 
marily it was designed to simulate, a little 
closer, actual situations involving real crimes, 
but at the same time maintain rigorous ex- 
perimental control. By allowing S to choose 
his own time, place, and method, plus the 
incentive for ingenious crimes, S became much 
more actively and personally involved than if 
he merely walked through the motions of a 
staged crime. Some of the uncertainties in- 
volved in real crimes were introduced. Not 
every crime attempted is automatically suc- 
cessful and there is usually no way of knowing 
in advance. A criminal usually does not know 
exactly what his gains will be (if any) until 
he successfully completes the crime. Further- 
more, in order to keep the rewards of his 
crime he must be able to escape detection. 
The motivation for escaping detection may be 
a complex one involving some combination 
of fear of punishment and hope of reward. 
Most experimental studies have increased this 
motivation by threat of punishment (usually 
electric shock). In the present study, the 
hope of reward was experimentally varied to 
see if this had any effect on ability to escape 
detection in a polygraph examination. It has 
sometimes been suggested that successful 
detection rates in actual crimes would be sig- 
nificantly higher than in experimental studies 
since the tremendous motivation involved in 
the former would increase the amount of 
autonomic responsiveness to questions relevant 
to the crime. While the differences in this 
study are in this direction, they do not sup- 
port this hypothesis. They do not clearly 
refute it either. The small samples of guilty 
Ss and the limited rewards combined with 
no real threats of punishment in the present 
study require caution in interpreting these 
results. 

One observation did occur which merits 
further study. A common attitude in the lie- 
detection field (cf. Lee, 1953) is that the 
GSR is not useful because it is “too sensitive,” 
Lykken (1960) discounted this objection to 
the GSR on the basis of his experimental 
studies with essentially fairly low levels of 
motivation to deception. In the present study 


it was observed, particularly in guilty Ss who 
possessed $50 vouchers, that there seemed 
to be a great many spontaneous fluctuations 
in base-line GSR. Scoring GSR responses to 
relevant questions must be done very carefully 
in such cases to prevent the confusion of 
spontaneous base-line fluctuations from being 
counted as GSRs. While this was not too 
difficult in the present experiment, it does 
indicate caution in uncritical adoption of 
Lykken’s scoring criteria for criminal in- 
vestigations. Malmo (1957) has suggested 
that base-line fluctuations can be a useful 
index of drive level and, consequently, a very 
high drive level produced in a suspect for 
any reason (e.g., merely being suspected of 
a crime) would reduce the reliability of scor- 
ing GSR responses. In this sense the “extra- 
ordinary sensitivity of the GSR” may not be 
a “clear virtue” as Lykken (1960) suggests 
that it is. 

Some experimental studies stack the odds 
for correct detection very much in favor of 
the polygraph examiner by merely requiring 
him to separate one guilty suspect from 
several innocent controls. In criminal investi- 
gations the odds are not always so nice. 
Often there are several suspects who had good 
motives for committing the crime, some may 
have been planning it and some may have 
even attempted it unsuccessfully. But to the 
extent that these experimental results can be 
generalized, the guilty-knowledge technique 
seems to work as well in either situation. The 
92% correct classification of guilty Ss and 
100% correct classification of innocent Ss is 
essentially the same using the present pro- 
cedure as the 93.9% correct classification of 
guilty Ss and 100% correct classification of 
innocent Ss using Lykken’s (1959) procedure. 
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COMPARISON OF BEHAVIORAL STYLES BETWEEN 
ENTERING AND GRADUATING STUDENTS IN 
OFFICER CANDIDATE SCHOOL 


PETER B. PETERSEN axo GORDON L. LIPPITT 
George Washington University 


2 groups of students in an officer candidate school (N = 880) are compared to 
determine their differences in behavioral styles. The instrument used is the 
Job Analysis and Interest Measurement. There is a difference at the .01 
level of significance in the mean scores between entering (N = 561) and gradu- 
ating (N = 319) students in 4 prechosen behaviora] dimensions. Entering stu- 
dents compared with graduating students place a higher value on approval 
from others. In the 3 other scales considered, the mean graduating students’ 
scores are higher than the entering students’ scores. This indicates that gradu- 
ating students as compared with entering students tend to be more self- 
assertive, are more likely to be persuasive leaders, and are more prone to like 


supervisory activities. 


A worthwhile evaluation of training may 
be accomplished by evaluating groups of 
trainees before and after they are trained. 
Schein (1965) favors an evaluation of train- 
ing which includes an observation of trainees 
after they have returned to their jobs. A con- 
trol group may be helpful in determining if 
the changed behavior is actually caused by 
training. Prior studies tend to indicate that 
the change process may be conducted in a 
variety of ways. Differences in the change 

for the most part seem to center on 
the degree of influence exercised by the person 
being changed. Schein (1961) visualizes that 
“coercive persuasion” is employed in some 
management development programs. He con- 
siders that many managers being developed 
need to be “unfrozen” because they are 
psychologically unprepared to change certain 
attitudes in favor of untried new ones. Bennis 
(1964) suggests that an indoctrination form 
of change is being used by many schools. 
While the authors are interested in the change 
process, primary attention will focus on find- 
ing the specific differences between entering 
and graduating students, and then investigat- 
ing why these particular dimensions changed. 
The test instrument used in this study to 
compare these two groups of students is 
the Job Analysis and Interest Measurement 
(JAIM).* It has been used for studying 


1The authors are grateful to Regis H. Walther, 
author of the test instrument, for his personal as- 


more than 30 occupational groups including 
engineers, lawyéts, foreign service officers, 
ambassadors, judges, social workers, police- 
men, physicists, and secretaries. Walther 
(1964) found that this instrument can dis- 
tinguish among occupational groups and be- 
tween superior and weak performances within 
an occupational group. It is a 125-item self- 
report questionnaire used to measure the 
personal qualities of the worker (other than 
aptitudes, training, or knowledge), which have 
an influence on job success or failure. 

This study is a comparison of the average 
behavioral styles between a group of entering 
and graduating students in one specific army 
officer candidate school (OCS). Behavioral 
styles refer to the consistent ways in which 
a person organizes his physical, emotional, 
and energy resources. The specific army OCS 
used in this study, while anonymous, is 
representative of nine others. The duration 
of the training is 23 weeks. The mission of 
OCS is to develop combat leaders. This train- 
ing, according to a Department of the Army 
pamphlet (United States Army, 1966), is 
designed to place the student under physical, 
mental, and emotional stress to simulate the 
stress and fatigue of combat. Within this 
particular branch being studied, 77% of its 
newly commissioned officers will be provided 
by this training source, 22% will be provided 
sistance in accomplishing much of the work de- 
scribed in this paper. 


Benaviogat Sryres N Orc Caxnmarss 67 


of newly © 
this training source indicates a need for this, 
and similar, studies. 


Problem 


prepared to resign after they are in the course 
for only a short time, some for only a few 


hours, An initial thought was that students 4 passives c 
in this particular anair, while a angative | valor imdiosim A 
re eta TT A 


** 


complete OCS. A start in making such an 
idea fruitful might be a comparison of the 


behavioral styles between entering and grad- É 
uating students in OCS. This study will at- the student’s relationship with internal fac- 


tempt to make such a comparison, pertaining 

to four specific behavioral style scales. pry fy oe Se 2 
Groups of students (rather than individ- J 

uals) will be analyzed. A group of entering F 

students will be compared with a different pinch — ona age rows Ber s } 


group of graduating students. Behavioral ised 
styles will be analyzed in terms of the signifi- and influence by the student. 
cant difference between the mean behavioral pf Study 


styles of the two groups. The hypotheses will hypoth 
reflect directly the purpose of this research, In the development of the d 


which is to compare four this study an initial thought was ——— 


c 
1 d Perseverance 
7 ee Nene ANE TOEI M of Ferree 
There are a number of factors which, in ise ad Aes 2 or fih from the 
the opinion of the authors, influence the "A. pilot study was conducted in June 
behavioral styles of officer candidate students i ree i Zn the average behavioral styles 
so that they may become qualified to grad: ol se catering students were compared with 
uate. Of the many factors, four are considered a werage behavioral styles of 33 graduat- 
to be especially pertinent to this study. Their ing O as A summary of the results of this 
pertinence is judged on the basis of their iot study is shown in Table 1 
role in determining the culture that char- ae hypotheses of this study are stated 
acterizes OCS, the social ce of be. below: 1 
student within OCS, and the nship be į hesi tering 
tween the student’s social and 1 cared ath — ae 3 
i oat e 
experiences. These factors arè (1) a higher value on approval pipa others; 
; uad (4 therefore, entering students will score sig- 
struction, (3) the tactical officer, (4) nificantly higher on the JAIM scale Approval 


It is understood that many other factors From Others. ; ; 
may influence their behavioral styles. Some 2. Hypothesis IT: Coan speed 
of these other factors are: (1) the prior students, graduating students in 
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to be more self-assertive; therefore, graduat- 
ing students will score significantly higher on 
the JAIM scale Self-Assertiveness. 

3. Hypothesis III: Compared with entering 
students, graduating students in OCS are 
more likely to be persuasive leaders; there- 
fore, graduating students will score signifi- 
cantly higher on the JAIM scale Persuasive 
Leadership. 

4. Hypothesis IV: Compared with entering 
students, graduating students in OCS are 
more prone to like supervisory activities; 
therefore, graduating students will score sig- 
nificantly higher on the JAIM scale Super- 
visory Activities, 


MerHop 


This study attempts to compare, systematically, 
several variables in the behavioral styles between 
entering and graduating students in OCS. This 
difference is revealed by comparing the mean 
behavioral styles of a sample from each of these 
two populations. The hypotheses of this study are 
concerned with the differences in four specific be- 
havioral style scales represented in the test instru- 
ment used in this study. The instrument, called the 
Job Analysis and Interest Measurement (JAIM), 
provides measures for 34 scales relating to attitudes, 
interpersonal behavior, formal organizational be- 
havior, decision-making behavior, work preferences, 
values, and academic achievement, The difference 
in the specific scale means, if statistically sig- 
nificant (determined by using Fisher’s t), will form 
the basis for further evaluation, explanation, and 
possible implications and speculation. 

Entering students sampled were administered the 
JAIM during their first week of training in OCS, 
while graduating students were sampled in their 
twentieth to twenty-second week. Questionnaires 
were administered during the period from June 21, 
1966 through October 21, 1966. The conditions of 
the testing facilities were reasonably equal in all 
cases. The data preparation included transferring 
the student answers from each answer sheet to 
three data-processing cards. Starting October 7, 
1966, an administrative-data questionnaire was also 
completed by each person administered the JAIM, It 
is represented by a fourth data-processing card 
for each student. The analysis of the data was 
accomplished by an IBM 1620 computer. 2 

The method of evaluating the JAIM results in 
this study is to determine if there is a significant 3 
difference between the mean scores on a specific 
JAIM scale between entering students (N =561) 


Computer operations were conducted by David 
Reimer, staff associate in the Social Research Group 
of The George Washington University. 

The term “significant” is reserved for the mean- 
ing statistically significant. 


and graduating students (W = 519). The significance 
in the difference of means is tested by conducting a 
t test of the two means. In this study a positive 
£ value Indicates that graduating students are higher 
in a particular scale, while a negative ¢ value in- 
dicates that entering students are higher as per- 
tains to this particular scale, In the evaluation of 
the two experimental groups, significance at the 
01 confidence level is p< 01 with 1 2.88 for d 
= 878. 

The students in the population sampled were 
homogeneous as to their age, sex (male), state of 
health, and general educational level. Some Negro 
students were present, but the proportion of Negroes 
in this OCS was less than the proportion of Negroes 
in the Army enlisted ranks. The results of an ad- 
ministrative-data questionnaire completed by enter- 


TABLE 2 


Group AGGREGATES OF Inprvipvat CHARACTERISTICS 
FOR ENTERING (N = 151) AND GRADUATING 
(N = 152) STUDENTS 


Entering | Graduating 


students | students 
Method of entering service 
Volunteer 61% 72% 
Draft 39% 26% 
Other (i.e., National Guard 
Unit called to active duty) 0% 2% 


Students presently in class who have been turned back 


from another class 
Turned back for academic 
reasons 1% 1% 
Turned back for leadership 
reasons 1% 13% 
Turned back for other reasons 
(le., medical) 0% 4% 
Educational level 
High school or equivalent 18% 20% 
Some college but did not 
graduate 54% 47% 
Graduated from a junior 
college 5% 5% 
Graduated from college 16% 20% 
Some postgraduate work 6% 1% 
Master’s degree or higher 1% 1% 


— eee 
This may indicate that, for this study, difficulties in meeting 
i p proni standards have a higher i 


~~ 


fing and graduating students are shown in Table 2. 
This questionnaire was designed to help describe 
the sample. It reflects the aggregate of individual 
characteristics for a group containing 151 entering 
students and a group containing 152 graduating 
students. 

Since 13 entering classes and 10 graduating 
clases were sampled, the writers felt it advisable 
fo determine the variance among different classes. 
The statistical test selected for this comparison of 

was the analysis of variance, which provides 
a method of testing for significant differences be- 
tween means among and between groups. Each of 
the four JAIM scales was evaluated in terms of 
the stability of the entering classes’ set of 15 
means, and the stability of the graduating classes’ 
set of 10 means. 


RESULTS 


The results of the analysis-of-variance tests 
for entering and graduating classes are given 
in Table 3. Among the entering classes, one 
significant difference ($ < 05 with F = 1.77 
for dj = 12) was found on the Approval From 
Others scale. The fact that one out of eight 
comparisons shows a significant difference 
among groups indicates that different classes 
vary more than would be expected due to 
chance. This should be considered when 
interpreting the data. 

As shown in Table 4, the significance in 
the difference of means is tested by conduct- 
ing a ¢ test of the two means. In this analysis 
a positive ¢ value indicates that graduating 


TABLE 3 


ANALYSIS OF VARIANCE AMONG ENTERING AND 
GRADUATING CLASSES 


JAIM scale 


Approval from others 
Entering classes (N = 13)* 
Graduating classes (V = 10)? 


Supervisory activities 
Entering classes* 
Graduating classes” 


Entering classes (N = 13): 
<.05 with F = 1.77 for df = 125? <.01 with F =2.19 
OTe cdnsting classes (N = 10) 
juatiny lasses = t $ 
8 with F 1.91 for df =9; P <01 with F = 2.48 
or df = 9. 
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un n or Scour: tos tan Teo 
Bxreeneent as Geoore 


nee | P I 


JAIM scale 
Entering students 
higher (V = S61) 
f from others} % | O1 | Yes 
Participative leader- S43 | OF | Not 
ship hypothe 
| fmd 
Graduating students | i 
higher (N = 319): | hes yl 
Seif-assertivencss | sót | 01 | Yes 
Persuasive > | 4.15 | D | Ye 
28 91 | Ye 


indicates graduating students are higher 
in this particular scale, while a negative ( valor indicates t 


entering students J ie aime to this ae 
3 lod eop < 


xalar 
nn = L 
for dy = 874. 
+ Át the .01 bevel of significance, 


students are higher in à particular scale 
and a negative ¢ value indicates that entering 
students are higher as pertains to a particular 
scale. In the analysis of the two experimental 
groups, a significance of at least the 01 
confidence level occurs when t is greater 
than or equal to 2.58 for 878 degrees of 
freedom. 
Entering students compared with graduat- 
ing students tend to place a higher value on 
from others, as determined by the 
Job Analysis Interest Measurement (JAM). 
In the other three scales, the graduating 
students were higher, which indicated the 
following: 


1. Compared with entering students, grad- 
uating students in OCS tend to be more self- 
assertive, as determined by the JATIM. 

2. Compared with entering students, grad- 
uating students in OCS are more likely to be 
persuasive leaders, as determined by the 
JAIM. 

3. Compared with entering students, grad- 
uating students in OCS are more prone to like 
supervisory activities, as determined by the 
JAIM. 

There is a significant difference in the 
mean JATM scores between groups of entering 
and graduating students in the four prechosen 
comparisons established by the four hy- 
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potheses. Within the structure of the re- 
straints established by earlier assumptions, 
the four hypotheses of this study are affirmed 
and accepted 

The Approval From Others scale measures 
the degree to which the individual values 
himself by obtaining the approval of others. 
The results of this study indicate that the 
Approval From Others scale, located within 
the “Value” category of scales, has the high- 
est t value among any of the 34 JAIM scales 
for this particular research study (t value 
of —7.40). Graduating students appear to be 
mission oriented: some express their philos- 
ophy as “not overly concerned with what my 
buddies think; it’s the mission that counts,” 
In contrast, entering students tend to “worry 
too much about what their buddies think.” It 
seems that mission orientation is paramount 
during training, while the approval from 
others tends to be secondary. 

Studies in social psychology during World 
War II suggest that the personality of the 
officer candidate is changed to adapt to 
new standards. Stouffer, Suchman, DeVinney, 
Star, and Williams (1949) describe this 
change as follows: 


The hopeful candidate is now subjected to a nearly 
catastrophic experience, which breaks down to a 
large extent his previous personality organization. 
His previous valuations fail him, and in order to 
find a basis for self-respect, he must adopt new 
standards or escape from the field. His high mo- 
tivation to become an officer usually rules out the 
latter alternative. At the same time, new, ap- 
propriate attitudes are built up and established. 
The catastrophic experience provides a kind of 
purgatory, a definite demarcation from the candi- 
date’s enlisted incarnation that puts a barrier be- 
tween the new officer and his enlisted memories. 
It has some of the characteristics of a conversion 
experience, or the ordeal of a medieval knight. The 
effect of this ordeal on the officer candidate is not 
only to attack his previous personality, but to 
exert a positive influence in the desired direction. 


An interpretation that may be drawn is that 
students are being molded into the behavioral 
styles required to become successful junior 
officers. The graduating student is action 
oriented, with a greater concern for the mis- 
sion than for the welfare of his subordinates. 
However, this does not imply that he is not 
concerned with their welfare. It does imply 


that concern for subordinates is second only 
to the accomplishment of the mission, 


Suggestions for Further Research 


An evident weakness in this research is 
that the authors cannot state what portion 
of the difference in behavioral styles is caused 
by: 


1. Students actually changing their be- 
havioral styles during the time considered; 

2. Students with certain behavioral styles 
resigning from the course and subsequently 
not being sampled as graduating students; 

3. Students with certain behavioral styles 
failing from the course and subsequently not 
being sampled as graduating students. 


Additional research should devise a system 
so that individual comparisons may be made 
instead of comparing the mean of each group. 

A study conducted a year or two from now 
may provide answers as to the stability of 
the behavioral styles “learned” or changed“ 
while at OCS. To what extent, if any, the 
individual reverts toward his original be- 
havioral style offers attentive consideration. 
A most interesting determination in the opin- 
ion of the authors is: To what extent are the 
behavioral styles of students actually 
changed? 
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SOCIAL TMMATURITY, INTELLECTUAL ABILITY, AND 
ADJUSTIVE BEHAVIOR IN COLLEGE ' 


DAVID KIPNIS 
Temple Universiiy 


It was hypothesized that students clawited as socially immature. in comparhon 
to less socially immature students, would be more visible to their instructors 
because of a higher verbal output in class, would have a higher abeentee rate, 
and would underachieve in academic performance, The Ss were make Temple 


were significantly more 
behavior. That is, as 


as contributing 


were absent from class (p 
Scholastic Aptitude Test 


psychology course. 


In the last several years there has been 
increased interest in describing adult per- 
sonality structure in terms of stages that have 
their origins in the person’s psychosocial de- 
velopment. For example, Loevinger (1966) 
has described at least seven stages or mile- 
stones in the development of the ego; Erikson 
(1959) has formulated this developmental 
sequence in terms of stages of ego identity; 
Sullivan, Grant, and Grant (1957) in terms 
of maturity levels; and more recently Maslow 
(1964) has described individuals in terms of 
levels of synergy. Despite the diversity of 
terms and stages pro , there appears to 
about the behaviors 
of persons classified as at relatively early 
stages of personality development, that is, 
socially immature 


fied as socially immature 
socially defined achievement roles such as are 
associated with high school (Kipnis, 1965a; 
Boessel, 1954), college (Gough, 1965; Heil- 
brun, 1965), or job (Kipnis, 1965b). The 


1 This research was aided by 
trustees of Temple University. 


a grant from the 


University students enrolled in 25 sections of 
course. Classification of social immaturity was 
support of the Ist hypothesis it was found that socially immature students 
visible to their instructors by virtue of their classroom 
freshman they were evaluated as more disruptive of 
classroom proceedings (p< OS), and as 
more to classroom discussion 
2nd hypothesis it was found that significantly more socially immature students 

< 01). Finally, among students in the top tertile of 
Scores, socially immature students underachieved in 
an introductory mathematics course (p < 01), but not in an introductory 


an introductory prychology 
by paper-and-pencil test, In 


upperclasmen they were evaluated 
(p < 05), In support of the 


evidence also suggests that the relation be- 
tween social immaturity and underachieve- 
ment is strongest more intelligent 
persons. For „ Roessel (1954) found 
that among more intelligent students those 
with high Pd scores on the MMPI dropped 
out of high school more frequently than those 
with low Pd scores, but this relation was not 
present among less intelligent students. Gough 
(1965) has consistently found among intel- 
lectually gifted persons that the Socialization 
scale of the California Psychological In- 
ventory is related to school achievement 
(poorly socialized students underachieved), 
and Heilbrun (1965) has reported a measure 
of conformity to social values as predictive 
of college dropouts among the more intel- 
ligent, but not the less intelligent. 

While the findings are consistent on the 
relation between achievement and social im- 
maturity, there is less information available 
on the nature of the day-to-day activities of 
socially immature individuals that eventually 
result in their underachievement. A start 
toward providing this information was made 
by the present writer in a taxonomic study 
of the behaviors and attitudes that dis- 
tinguish persons classified as socially im- 
mature (Kipnis, 1966). Among the findings 
was that socially immature persons did not 
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react as frequently as less socially immature 
persons to embarrassing, shameful, or anxiety- 
provoking situations. Presumably, this lowered 
level of reactivity would result in the socially 
immature person not experiencing anxiety 
over failure and consequently not being mo- 
tivated to reduce anxiety by striving for 
achievement. It was also found that socially 
immature individuals were less accepting 
of conventional values relating to obeying 
rules and regulations? This lack of concern 
over regulations might further provoke failure 
through a disregard of institutional rules. The 
foregoing findings however were based upon 
the intercorrelation of several questionnaires 
measuring strength of belief in conventional 
social norms, reactivity to anxiety, and level 
of social immaturity. There remains the ques- 
tion of whether these differences in attitudes 
and beliefs are also manifested behaviorally. 
The present study reports evidence on this 
point. 

Three studies were done. In the first the 
prediction was tested that socially immature 
students would participate more in classroom 
discussions and be more visible to their in- 
structors by virtue of their classroom be- 
havior. This prediction was based upon a 
prior finding that socially immature students 
reported that speaking in public was not 
likely to make them anxious. Accordingly it 
was expected that they would talk more in 
class. The second prediction was that socially 
immature students would be absent more 
frequently from class. This prediction was 
based upon the assumption that the behavior 
of socially immature students would be less 
constrained by general university regulations 
forbidding the cutting of classes. The final 
prediction was that among more intelligent 
students, the socially immature student in 
comparison to the less socially immature stu- 
dent should earn lower grade-point averages 
(GPAs). This prediction was based upon 
the previously cited literature on the relation 
between academic achievement and social 
immaturity among the more intellectually able 
population. 


? Socially immature respondents oriented their 
values around the acquisition of material things, 
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PROCEDURE 


Measure oj Social Immaturity 


An index of social immaturity was obtained by 
adding the unweighted scores from two purported 
measures of social development—the Socialization 
scale from the CPI and the Insolence scale (de- 
velopment of this latter scale is given below). The 
use of this combined score was based upon the goal 
of improving the measure of immaturity. A sub- 
sequent section of this paper will report on the 
relative effectiveness of the combined scores versus 
the scores taken singly. 

In a sample of 573 male Temple University 
students, the Insolence scale correlated 45 with the 
Socialization scale (scoring key reversed so that 
high scores indicated poorly socialized), suggesting 
that both scores were measuring aspects of the 
same personality organization. Both scales con- 
tributed relatively equal weight to the combined 
social immaturity index, as indicated by the fact 
that the means and standard deviations of both 
scales were almost exactly equal (Socialization 
scale: M = 16.70, SD =5.71; Insolence scale: M = 
1731, SD = 553. 

In the present research, persons with social im- 
maturity index scores in the top third of the 
Temple University distribution (39 or more) were 
designated as high in social immaturity (high), 
persons with middle third scores as middle in social 
immaturity (middle), and persons with bottom 
third scores of 29 or less were designated as low 
in social immaturity (low). It should be noted that 
students classified as low in social immaturity could 
probably be further subdivided into several higher 
stages of development if additional cognitive and 
attitudinal information were available. In general 
the scales used appear most sensitive in distinguish- 
ing persons fixated at earlier stages of development 
from persons at all hypothesized latter stages. 

Insolence scale, This is a 41-item paper-and-pencil 
biographical-type inventory. If all items were an- 
swered in the keyed direction, it would convey the 
impression of a physically active, aggressive, pleasure- 
seeking personality whose childhood was character- 
ized by early independence from family and con- 
flicts with grade-school authorities, The scale is 
based upon 27 of the 58 items in Torrance and 
Ziller’s (1957) Risk Scale to which have been added 
14 additional items describing aspects of childhood 
behavior. All items were selected on the basis of 
their relation to school and job performance in 
the United States Navy. 

The usefulness of the Insolence scale as a measure 
of social immaturity has been explored in a series 
of studies. In several laboratory studies, persons 
classified as high in social immaturity on the basis 
of Insolence scale scores were found to be non- 
compliant and resistant to an experimenter's in- 
fluence (Kipnis & Wagner, 1965), and to be non- 
productive when working for a leader who was low 
in status and power (Kipnis & Wagner, 1967), 
and to be easily aroused to aggressive behavior 
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(Kipnis, 1966). Among more intelligent patients 
admitted to a psychiatric ward in a Navy hospital, 
the scale had a biserial correlation of approximately 
60 with psychiatrists’ diagnoses of character and 
personality disorder (Kipnis, 1965b). 

In the population at large, the Insolence scale 
has been found to correlate 41 with the Extrover- 
sion scale of the Maudsley Personality Inventory, 
—41 with the Achiever Personality scale of Fricke's 
Opinion, Attitude and Interest Survey, 52 with the 
Sensation Seeking Scale of Zuckerman et al. (1964), 
32 with a measure of lack of emotional responsive- 
ness to shame-provoking situations developed by 
Lykken (1957), and .29 and .25 with the Pd and 
Ma scales of the MMPI, respectively. The scale 
does not correlate with age or general intelligence 
as estimated by both the Navy's General Classifica- 
tion Test and the Scholastic Aptitude Test. 

Socialization scale. This is a 51-item true-false 
attitude scale from the CPI. Gough (1965) has 
described the scale as identifying individuals along 
a continuum of asocial to social behavior and as 
able to forecast the likelihood that any person will 
violate the rules and norms of his culture. Re- 
sults of a wide variety of studies, summarized in 
Gough (1965), indicate that persons described as 
asocial by the scale do indeed exhibit such be- 
haviors as delinquency, underachievement in school, 
violations of paroles, and show a lack of responsive- 
ness to social reinforcement. 


Subjects 


The Ss consisted of male Temple University 
students enrolled in 23 sections of an introductory 
psychology course, taught by 13 different instructors. 
Class size averaged 42 students with approximately 
half of each class being male students. The specific 
number of Ss used varied from study to study, and 
the exact numbers used will be given when the 
results from each study are presented. Approxi- 
mately 49% of the Ss were freshmen and 51% 
were upperclassmen (mainly sophomores). The Ss 
were tested with the Socialization scale and the 
Insolence scale during the first week of the term. 


Intellectual Ability 


Intellectual ability was estimated on the basis of 
the combined verbal and numerical scores from 
the Scholastic Aptitude Test (SAT). To detect any 
possible moderating effects of intelligence, Ss were 
divided into thirds on the basis of their combined 
SAT scores, or at the median of the combined SAT 
distribution. The number of Ss available for each 
study dictated the decision as to whether Ss were 
divided into two or three equal groups. 


2 Three items were eliminated as inappropriate 
for the present population. Permission to use the 
scale for research purposes was granted by Consulting 
Psychologists Press, Inc., publishers of the CPI. 


Classroom Visibility 


Nine instructors teaching 13 sections of intro- 
ductory psychology were asked to rate all male 
students in terms of their classroom participation.* 
There were between 20 and 28 male students in 
each section, The evaluations were made in the 
ninth and tenth weeks of the school term. 

The evaluation forms consisted of three scales 
Each scale contained the alphabetical listing of all 
male students in a section, and the instructor was 
asked to circle the names of the students on each 
list that were like the description given at the top 
of the scale, The scales read as follows: 

Classroom visibility. “Some students attract the 
attention of their instructors because of their class- 
room behavior. These students may attract notice 
because they consistently ask questions or make 
comments about the course material. (Consider in 
this scale the frequency of classroom participation 
rather than the quality of the participation.) Other 
students attract attention because they talk to 
classmates or they do not appear to be attending 
to the lecture, or they exhibit other forms of 
inappropriate classroom behavior. Place a check 
alongside the names of students whose classroom 
behavior caused you to notice them.” 

Facilitative behavior. “Some students contribute 
to the teaching process by asking reasonable ques- 
tions or by making constructive comments. Place a 
check next to the names of the students who most 
frequently contribute to discussion in your class.” 

Disruptive behavior. “Some students disrupt 
classroom proceedings by asking inappropriate or 
hostile questions, by obvious inattention, or by 
other kinds of disruptive acts. Place a check 
alongside the names of students who best fit the 
above description.” 


Analysis 


The ratings on each scale were analyzed by means 
of a 3X 2 X 2 factorial design, with three levels of 
social immaturity, two levels of intellectual ability 
(median split on SAT scores), and two levels of 
students’ class (freshman versus all upper classes) 
as the independent variables. The classification of 
students in terms of class level was added to test 
the generally held belief that freshmen participate 
less in classroom discussions than do upperclass- 
men, It may be noted that the instructors had no 
systematic knowledge of students’ SAT scores, class 
level, or level of immaturity scores. 

For each S, a score was obtained by assigning a 
weight of 1 if the instructor checked his name on 
a given scale, and a score of 0 if his name was not 
checked. These scores were used as the dependent 
measure in the analysis of variance. A total of 285 


4Four instructors did not participate: two be- 
cause they knew the immaturity scores of their 
students, one because he did not notice any students 


by name, and one instructor because he did not 
complete the ratings. 
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TABLE 1 


SUMMARY OF THE ANALYSIS OF VARIANCE: 
TEACHERS’ RATINGS or CONSTRUCTIVE 
CLAssROOM BEHAVIOR 


Source df MS F 
Immaturity (I) 2 .1732 1.47 
Class level (CL) 1 1.0890 9.24 
Intellectual ability (A) 1 6792 5.76* 
IXCL 2 4743 | 4.02* 
IXA 2 bs 28522]. 4:82 
CLXA 1 5116 4.34* 
IXCLXA 2 1857 1.57 
Error 273 1179 

=p Sl. 


Ss were evaluated. Overall, 23% of the students 
were nominated as visible to the instructor, 14% 
were nominated as contributing constructively to 
classroom discussions, and 9% were nominated as 
disruptive of classroom proceedings. Two percent 
of the Ss were evaluated as both contributing 
to classroom discussion and as being disruptive 
of classroom procedures. The number of Ss in each 
of the nine cells of the design ranged from 17 to 
36; therefore the analysis of variance was based 
upon an unweighted means solution. 


RESULTS 


The analysis of the instructors’ first 
ratings on classroom visibility yielded no 
significant relations between visibility and 
social immaturity or class level. However, 
there was a significant main effect for in- 
tellectual ability—F (1/273) = 5.74, p < 05. 
More intelligent students were rated by their 
instructors as being more noticeable by virtue 
of their classroom behavior than were the 
less intelligent students. An average of 30% 
of Ss with above-median SAT scores and 
17% of Ss with below-median SAT scores 
were rated by their instructors as visible. 

The second rating scale requested instruc- 
tors to nominate students whose classroom 
behavior was constructive and contributed to 
the teaching process. The results of the 
analysis of variance of these ratings of con- 
structive behavior are summarized in Table 1, 
Table 2 shows the percentage of students 
nominated by their instructors. It can be seen 
in Table 1 that both class level and intel- 
lectual ability were related to ratings of 
constructive behavior, at the .01 and .05 
levels, respectively. However, there was also 


a significant interaction between these two 
variables. Inspection of Table 2 reveals the 
basis for this interaction. In general, upper- 
classmen who were above the median in 
intellectual ability were nominated more 
frequently as contributing to classroom dis- 
cussions than were below-the-median upper- 
classmen or freshmen at all levels of intel- 
ligence. The last column in Table 2 sum- 
marizes these data. 

Table 1 also shows a significant interaction 
between social immaturity and class level 
( < .05). Primarily, this significant effect 
was based upon the fact that among upper- 
classmen, high immature Ss were evaluate 
as contributing significantly more to class- 
room discussion than were middles or lows. 
Inspection of Table 2 shows that this effect 
was limited to more intelligent upperclass- 
men. Within this more intelligent upperclass 
grouping, 53% of the highs as compared to 
19% of the middles and 22% of the lows 
were evaluated by their instructors as con- 
tributing to the classroom discussion. This 
latter finding is consistent with Hypothesis 1, 
which predicted that high immature students 
would be more visible to their instructors. 
However, it appears that, in terms of con- 
structive behavior, the hypothesis holds only 
among more intelligent upperclassmen, 

The last rating scale asked instructors to 
nominate those students in their class whose 
behavior was disruptive of classroom proceed- 
ings. Table 3 summarizes the analysis of vari- 
ance of these ratings, and Table 4 shows the 
percentage of students, classified in terms of 
the independent variables, who were nomi- 
nated by instructors as being disruptive., 


TABLE 2 


PERCENTAGE oF SUBJECTS EVALUATED AS CON- 
TRIBUTING TO CTLass DISCUSSION 


Level of social 


Class Intellectual e 


age 
Tevel ability ANETAR 


High | Middle| Low 


Freshmen | Above median 30 6% 16% 10% 
elow median | 3% 15% 8% 9% 

Upper Above median} 53% 19% | 22% 30% 
classes | Below median 16% 5% 17% 13% 


Note.—N = 285, 
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TABLE 3 


ANALYSIS OF VARIANCE: INSTRUCTORS’ RATINGS 
or STUDENTS’ Disruptive BEHAVIOR 


Source df MS F 
Immaturity (I) | 2 2479 3.17˙ 
Class level (CL) 1 0023 <i 
Intellectual ability (A) 1 1088 1.39 
IX CL 2 .2649 3.39* 
IXA 2 .0725 <1 
CLXA 1 0906 1.16 
CL XA 2 0804 1.03 
Error 273 0781 


Social immaturity was significantly related 
to ratings of disruptive behavior (% < 05). 
However it can be seen in Table 3 that this 
relationship was modified by class level, as 
indicated by the significant interaction be- 
tween the two variables. Among freshmen, 
high immature Ss were nominated more fre- 
quently than middle immature or low im- 
mature Ss as being disruptive. Among upper- 
classmen on the other hand there was no con- 
sistent relation between levels of immaturity 
and disruptive behavior. The findings are 
again consistent with the general prediction 
that high immature Ss would be more visible 
to their instructors than middle or low im- 
mature Ss. However, in terms of disruptive 
behavior, the findings indicate that this pre- 
diction only holds among freshmen. 


Absenteeism 


During the twelfth week of the 16-wk. 
semester, attendance was taken in all classes. 
Any person absent on the day attendance was 
taken was assigned a score of 1 and those 
present a score of 0. In all, 12 instructors 
took attendance among 459 students; 21% 
of the students were absent. 

Seven instructors habitually took attend- 
ance during the semester, and these instruc- 
tors had 212 students in 11 class sections. 
The remaining five instructors had never 
taken attendance in their classes prior to 
this time. These instructors had 247 students 
in 10 class sections. It should be noted that 
three of the five instructors who did not 
take attendance usually took attendance dur- 
ing other semesters, but refrained this semes- 


ter at the request of the present writer. The 
request not to take attendance arose from 
the author’s interest in testing the effects 
of environmental restraints (the taking of 
attendance in this case) upon the conforming 
behavior of high immature Ss. A prior study 
had suggested that high immature Ss tended 
to conform to social norms when the environ- 
ment contained restraints against noncon- 
formity (Kipnis & Wagner, 1967). 
Attendance was analyzed by means of a 
3 x 2X 2X analysis of variance with three 
levels of social immaturity, two levels of 
restraint (instructors did or did not take 
attendance), two levels of class (freshman 
versus upper classes), and two levels of in- 
tellectual ability (above and below the median 
SAT scores) constituting the independent 
variables. There was an average of 18 Ss 
in each of the 24 cells resulting from the 
four-way classification. The cell with the 
smallest entry had 13 Ss and the cell with 
the largest entry had 30 Ss. The dependent 
measure was a score of 1 or 0, depending 
upon the absence or presence of the student. 


RESULTS 


Table 5 presents the summary of the 
analysis of variance of the attendance data. 
Three of the independent variables showed 
significant association with attendance; they 
were social immaturity ($ < 01), class level 
(p < 05), and restraint ($ < .001). 

Table 6 gives the average percentage of 
absent Ss, classified in terms of social im- 
maturity, class level, and restraint. For ease 
of presentation, classification of Ss in terms 


TABLE 4 


PERCENTAGE OF STUDENTS NOMINATED AS 
DISRUPTIVE BY THEIR INSTRUCTORS 


Level of social immaturity 


Class Intellectual 
level ability gN 
High | Middle | Low 
Freshman | Above median 21% 6% 0% 
Below median 17% 4% 5% 
Upper Above median] 6% 21% 6% 
classes | Below median| 8% 5% 0% 


Note. V = 285. 


76 Davm Krenis 


TABLE 5 


SUMMARY OF THE ANALYSIS OF VARIANCE: 
STUDENT ABSENCES 


Source df MS F 
Immaturity (I) 2 .859 | 5.34** 
Class level (CL) 1 .901 5.60* 
Ability (A) 1 .033 <1 
Restraint (R) | 1 2.166 13.45% 
IX CL 2 008 1 
IXA 2 os | <i 
IXR 2 024 | < 
CLXA 1 981 
C. XR 1 82 <1 
AXR 1 197 1.22 
IX CL XA 2 007 <1 
IXCLXR 2 146 <i 
IXAXR 2 004 <1 
CLXAXR 1 065 <1 
IXCLXAXR 2 005 <i 
Error 435 161 

*p <.05 
* <.01 
* <.001 


of intellectual ability has been omitted, since 
this variable was not related to absenteeism, 
either by itself or in interaction with the 
other three independent variables. 

Whether or not instructors took attendance 
yielded the strongest relation with absentee 
rate. When the instructors exercised their 
restraining powers by taking attendance, an 
average of 13% of the students were absent 
as compared to 27% when these powers 
were not used. Further, freshmen had a higher 
absentee rate (25%) than upperclassmen 
(17%). Finally, as predicted, an average of 
29% of the high immature Ss, 18% of the 
middle immature Ss, and 14% of the low 
immature Ss was absent. 

It appears then that, without knowledge 
of actual illness which presumably accounted 
for at least some of the absentees, such 
factors as the student’s personality, his years 
of experience as a college student, and the 
use of restraining power by the instructor 
were significantly related to the presence of 
a student in class. In the extreme cells we 
have 42% of the high immature freshmen 
who were in classes where attendance was 


not taken absent, as compared to 2% of the 
low immature upperclassmen who were in 
classes where attendance was taken. 


Academic Achievement 


Final grades in introductory psychology 
and in an introductory mathematics course 
were used as the primary criteria of academic 
achievement. The analysis was limited to 
216 freshmen completing their second term 
at Temple University. This restriction was 
used in order to control the year in which 
the Ss had taken mathematics. The necessity 
for this control was indicated by a preliminary 
analysis which found that upperclassmen had 
significantly higher grades in a given course 
than did freshmen in the same course, even 
after controlling for intellectual ability. 

Grades were analyzed by means of a 3 x 3 
factorial design, with three levels of social 
immaturity and three levels of intellectual 
ability, as measured by SAT scores, con- 
stituting the independent variables. Final 
grades in both courses, expressed on a 
0 (=F) to 4 (=A) scale, were entered as 
the dependent measures. 

Table 7 presents the results of the analysis 
of variance of mathematics and psychology 
grades. There was no significant relation 
between level of immaturity and the final 
grade in psychology. There was, however, a 
significant relation between final mathematics 
grades and level of immaturity (p < .05). 

Table 8 shows the average mathematics 
grades for Ss classified in terms of immaturity 
level and intellectual ability. High immature 
Ss earned lower mathematics grades than 
middle or low immature Ss. While the inter- 


TABLE 6 
PERCENTAGE OF STUDENTS ABSENT 

Level of immaturity 

Class Restraint 

level 

High | Middle) Low 
Freshman | Attendance taken 24% | 14% | 11% 
Attendance not taken] 42% | 37% | 21% 
Upper Attendance taken 22% | 6% | 2% 
classes | Attendance not taken| 27% 16% | 20% 


Note.—N = 458, 


| 


=f 


ApyusTIVE BEHAVIOR IN COLLEGE 77 


TABLE 7 


SUMMARY OF ANALYSIS OF VARIANCE: MATHEMATICS AND INTRODUCTORY PsycuoLooy 
GRADES IN RELATION TO INTELLECTUAL ABILITY AND SOCIAL IMMATURITY 
(UPPER FRESHMEN ONLY) 


Mathemati n p 
5 ematics Psychology 
df MS F df MS PF 

Immaturity (I) 2 3.77 3.12⁷ 2 1.35 1.32 
Intellectual ability (A) 2 1.46 1.20 2 12.15 11.89“ 
IXA 4 2.15 1.77 4 08 <1 
Error 207 1.21 207 1,02 

: es Eee — = 

45 S401 


action between immaturity level and ability 
was not statistically significant, the trend 
appears to be that, as intellectual ability 
increased, the relation between immaturity 
and mathematics grades also increased. Since 
these findings were consistent with prior re- 
search on the moderating effects of ability, 
it was decided to compute separate F tests 
between mathematics grades and level of im- 
maturity within each ability level. These 
separate tests found that the relation be- 
tween level of immaturity and mathematics 
was only significant among Ss who were in 
the top third of intellectual ability (p < 01). 
The differences in grades between high, 
middle, and low immature Ss were not sig- 
nificant among those students in the middle 
third or bottom third of intellectual ability. 

In addition to grades in mathematics, a 
tally was made of the number of freshmen 
either on probation or who had dropped out 
of Temple University at the end of their 
freshman year. Those freshmen on probation 


TABLE 8 


AVERAGE MATHEMATICS GRADES OF SUBJECTS 
CLASSIFIED BY LEVEL OF TMMATURITY AND 
INTELLECTUAL ABILITY 


Thellectoat Level of immaturity 
ability 
High Middle Low 
aS 
High third 146 1.80 2.55 
Middle third 1.93 2.05 2.27 
Low third 1.77 1.93 1.74 


Note.—N = 216. 


(cumulative GPA of less than 1.60) or who 
had dropped from Temple University were 
given a score of 1 and those freshmen not on 
probation were given a score of 0. These 
data were subjected to a two-way analysis 
of variance and the results of this analysis 
are summarized in Table 9. An additional 
eight freshmen who were excluded from the 
previous grade analysis because of incom- 
plete mathematics or psychology grades were 
included in the present analysis. 

The analysis of variance indicated that 
level of immaturity and intellectual ability 
were both related to unsatisfactory academic 
performance at the 05 and .01 levels, re- 
spectively. Table 10 shows the proportion 
of Ss classified by immaturity and ability 
level who were on probation or dropped at 
the end of their freshman year. It can be 
observed that at all levels of ability a greater 
proportion of high immature Ss than of 
middle immature or low immature Ss was on 
probation or had dropped out of Temple Uni- 
versity. Once again, to test for possible 


TABLE 9 


SUMMARY OF ANALYSIS OF VARIANCE: SUBJECTS 
ON PROBATION OR DROPPED FROM TEMPLE 
University (UPPER FRESHMEN Onty) 


Source df MS F 

EE 
Immaturity (I) 2 7509 4.03* 
Intellectual ability (A) 2 1.586 8.43** 
IXA 4 198 1.05 
Error 215 188 

*p <.05 

or 
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TABLE 10 
PERCENTAGE OF STUDENTS ON PROBATION 
on Drorreo 
Level of immaturity 
Intellectual . 
ability PR Tr š 
| High Middle Low 
High third | 28% 70% 4% 
Middle third B% | w% 4% 
Low third | 30% 38% 37% 


moderating effects of intelligence, separate F 
tests relating immaturity level to academic 
performance were computed within each abil- 
ity level. The results indicated that only 
among Ss in the middle third of intellectual 
ability was the relation between immaturity 
and grades significant (p < .05). 

In summary, the findings on academic per- 
formance suggest that high immature Ss tend 
to do as well as middle immature or low 
immature Ss in an introductory psychology 
course, but do considerably poorer than 
middle immature or low immature Ss in 
mathematics courses. This latter finding holds 
mainly among Ss who were in the top third 
of intellectual ability. In addition, there was 
a significant trend for more high immature 
Ss than middle immature or low immature 
Ss to be on probation or to have dropped 
from Temple University by the end of their 
freshman year. This trend was most marked 
among Ss in the middle third of intellectual 
ability. 


Comparison of the Validity of the Socializa- 
tion Scale and the Insolence Scale 


As a general statement the combined 
Socialization and Insolence scales yielded 
slightly stronger relations with the dependent 
variables studied here than did either scale 
by itself. Considering each dependent measure 
in turn, neither scale was related to the 
instructors’ first ratings of general visibility. 
The Socialization scale was significantly 
related to evaluations of facilitative class- 
room behavior (p < .05), while the Insolence 
scale was not. The Insolence scale was signifi- 
cantly related to evaluations of disruptive be- 
havior ( < .05), while the Socialization scale 
was not significantly related to this evalua- 
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tion. In terms of relationships with absen- 
teeism, the Insolence scale was significantly 
related to absenteeism ( < O5) and the 
Socialization scale just missed statistical sig- 
nificance (p < .08). Both scales were equally 
related to mathematics grades. 


Discussion 


Tt was originally proposed that high im- 
mature students would be more visible in 
class than middle immature or low immature 
students because they would experience less 
anxiety over speaking in public and hence 
should speak more. It was found that as 
freshmen, high immature students were more 
visible to their instructors because of their 
disruptive behavior. As upperclassmen, high 
immature students were more visible to their 
instructors because of their facilitative be- 
haviors. While these findings are consistent 
with the hypothesis of a higher verbal output 
for high immature students, the changes in 
evaluation of this output by instructors from 
freshmen to upper-class years also suggests 
that some form of social learning may be 
occurring during this period; that is, high 
immature students may be learning to modify 
their verbal output so that what they say 
is more acceptable to their instructors. An- 
other obvious possibility, of course, is that 
those high immature students who were 
rated as disruptive in their freshman year 
were dropped from Temple University and 
different high immature students were evalu- 
ated as facilitative in subsequent years. Un- 
fortunately, there are no data on the class- 
room behavior of the same students over a 
period of several years. Such information 
would be necessary to clarify the basis for 
the shift in evaluations (i.e., a longitudinal 
study). 

The second expectation of the present 
study was that high immature students would 
be less conforming to general university 
regulations against “cutting” classes than 
would middle immature or low immature 
students. This prediction was based upon 
previous findings that high immature stu- 
dents had lower internalized restraints against 
violating rules and regulations than did 
middle immature or low immature students. 
Lacking such internalized restraints, it was 


expected that high immature students would 
‘be more easily instigated by friends or their 
own impulses to cut“ classes. The findings 
supported the hypothesis in that significantly 
more high immature than middle or low im- 
mature students were absent from class. It 
Was also found that when instructors en- 
forced university regulations against absentee- 
"ism by taking attendance, there was a sig- 
nificant reduction in the number of all stu- 
mts who were nt. This latter finding 
be interpreted as indicating that if the 
‘environment is perceived to contain punitive 
restraints against nonconformity, acts of non- 
conformity will decrease. This interpretation 
is consistent with more general studies of 
| coercive power (French, Morrison, & Levin- 
ger, 1960; French & Raven, 1959; Zipf, 
1960) in which it has been found that reliance 
by authority figures upon threats of punish- 
ment induced higher rates of conformity than 
when threats were not relied upon. In essence, 
the taking of attendance by instructors rep- 
resents an exercise of threatening power, 
_ Since university regulations proscribe punish- 
ments for excessive numbers of unexcused 
absences. And to the extent attendance is 
taken, the probabilities are higher that stu- 
dents who are continually absent will be 
reported to the administration. 
* As a final point it should be noted that 
we findings on absenteeism among high im- 
mature students are not consistent with a 
hypothesis of Heilbrun’s (1965) that regi- 
mentation imposed upon students by uni- 
versities contributed to the academic failures 
© of bright, poorly socialized students. Heil- 
| brun, of course, was referring to a general 
S 


— 


system of social and academic regimenta- 
tion imposed upon students. Nevertheless, to 
the extent that taking attendance imposes 
restraints upon students, from Heilbrun’s 
hypothesis one would expect increased resist- 
ance and increased nonconformity among 
bright, poorly socialized students in classes 
where attendance was taken. Actually the 
Present study found a rise in conforming 
behavior of presumably an adjustive kind 
among poorly socialized students when re- 
t straints were imposed. While there is no long- 
term evidence on whether this kind of re- 
straint eventually frustrated bright, poorly 
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socialized students and caused them to drop 
out of school, an analysis of a subsample 
of second-term, bright (upper-third SAT 
scores) freshmen's final psychology grades 
yielded no differences in grades of high im- 
mature students in classes that did or did 
not take attendance. 

Turning next to academic performance, the 
most interesting finding was that among 
students with high third scores in intellectual 
ability, high immature students did as well 
as middle immature or low immature students 
in a course in introductory psychology, but 
not as well in a course in introductory mathe- 
matics. One likely explanation for these dif- 
ferences in achievement appears related to 
the task demands imposed upon students by 
the mathematics and psychology courses. The 
mathematics course requires constant class 
attendance and home study, including the 
solving of assigned problems, in order to 
follow the developing logic of the course. It 
is generally agreed that once a student falls 
behind in his mathematics assignments, it 
becomes difficult for him to catch up with 
the class. The psychology course, on the 
other hand, can be mastered by a reading of 
the textbook at reasonable intervals. With 
few exceptions, the quizzes and final test in 
psychology are taken directly from the text- 
book, As a result, class notes and attendance 
do not appear crucial for obtaining a satis- 
factory psychology grade. Furthermore, the 
interdependence between various sections of 
the psychology course is only slight, so that 
reading the section of the textbook on per- 
sonality theory, for instance, does not pre- 
suppose much knowledge of learning theory 
which had been taught earlier in the term. 

The task requirements of mathematics then 
appear to require more sustained effort by 
the student and fewer absences from class 
than do the task requirements of psychology. 
Such requirements seem less consonant with 
the behaviors typically shown by high im- 
mature students. While no data were available 
on this point, the author would speculate 
that high immature students’ underachieve- 
ment in mathematics resulted from their 
spending less time in study outside of the 
classroom and being absent from class more 
often than middle immature or low immature 
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students. More generally, these findings 
point to the possibility that high immature 
individuals will underachieve in courses in 
which (a) the units of information are highly 
interdependent so that progress in the latter 
stages depends upon having learned the earlier 
stages of information, and (5) the course 
requires sustained effort by the student both 
in class and in doing homework. 
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INTERFACE BETWEEN PERSONNEL AND 
ORGANIZATIONAL PSYCHOLOGY 


BERNARD M. BASS 


Management Research Center, Graduate School of Business, University of Pittsburgh 


The Ist part of this report delineates the importance of organizational 
considerations to the personnel psychologist as he concentrates on recruiting. 


selection, training, job design, 


and so forth. The 2nd part deals with the 


converse: the importance of personnel considerations to the organizational 


psychologist as he deals with problems of morale, supervision, 


teamwork, 


organizational design, and conflict resolution. 


Among the Inca farmers, work was re- 
garded as an end in itself, a ritualistic form 
of religious worship. But work that was not 
ceremonial Jacked sense and meaning. Work 
and worship were inseparable (Von Hagen, 
1957). Nevertheless, more often in non- 
industrialized societies, work was done merely 
to subsist. Better yet, it was delegated to 
women or slaves. The Tibetan concentrated 
on his prayer wheel; the Talmudist on study- 
ing the Law; the Balinese on art; the Athen- 
ian on citizenship; and the Iroquois on mili- 
tary honors. Success, well-being, security, and 
prestige depended not on one’s job but on 
contemplative zeal, scholarship, artistic efforts, 
oratory, or bravery in battle. 

On the other hand, in modern industrial- 
ized society, as among the Incas, work is 
becoming the center of life. Twenty years may 
be spent preparing to enter the world of 
work. As much as half of each working day 
will be spent at work or commuting to and 
from it. Success in life, sense of well-being, 
sense of accomplishment, security, and pres- 
tige all will be tied to what work is done. 
Likewise, work will determine one’s standard 
of living and economic well-being. Therefore it 
should come as no surprise to find that in our 
“work-intoxicated” society there are few con- 
siderations about employment recruiting, test- 
ing, hiring, training, job design, and other 
personnel problems that can be divorced from 
their setting within a larger context of social 
forces, organization, and society. š 

More generally, as Porter (1966) noted in 
his recent annual review chapter on person 
nel management, few issues in personnel Psy- 
chology can be completely examined without 
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attention to what usually are thought to be 
problems of organizational psychology. It may 
be profitable at this time to look at some of 
the more lively issues that lie at the interface 
between the two fields. Let us examine some 
organizational questions that appear when 
dealing with what usually are the concern of 
technicians in personnel psychology. To do 
this, I will focus briefly on recruiting, the 
application blank, testing, interviewing, train- 
ing, and job design. Following this I will 
reverse the examination and look briefly at 
some significant personnel questions, ques- 
tions of individual differences, which usually 
are the concern of organizational technicians. 


PERSONNEL IssuES INVOLVING 
ORGANIZATIONAL QUESTIONS 


Recruiting 


In one large sales organization, approxi- 
mately 3% of those applicants for jobs as 
salesmen were hired if they had been at- 
tracted to apply for the jobs through news- 
paper ads. On the other hand, 14% of those 
who had made application because of friends’ 
suggestions or who had been recommended to 
the sales agency by friends were hired. Un- 
fortunately, analyses of the merit ratings a 
afterwards of those hired found that 


year 
those who had been recruited through news- 
paper ads were in the fiftieth percentile in 


merit while those who had been introduced to 
the organization through friends were likely 
to be found below the twentieth percentile in 
merit. What had been revealed was quite sim- 
ple to a technician. Different selection ratios 
were operating. 
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Those recruited by newpaper 
ments were screened more carefully; pro- 
portionately fewer were hired. Those recom- 
mended by friends were screened lew 
oughly; proportionately more were hired. 
same was seen quite differently 


by the social psychologist. For him, inter- 


overriding importance to the utility of a se- 
lection program of the selection ratio as well 
as the importance of the quality and quantity 
of recruits who apply for testing. There is 
little systematic evidence of the many biases 
involved in decisions about where to recruit 
and whom to recruit for testing. For example, 
in academia, we seldom try for a low selection 
ratio. Rather, we solicit applications for aca- 
demic jobs from only the most prestigeful 
sources, neglecting the utilities of uncovering 
a larger number of recruits. The problem is 
complicated further when the recruits gravi- 
tate toward or away from certain occupations 
and organizations because of extraneous fac- 
tors. Consider how difficult it is to hire Ne- 


since tradi- 
tionally it has been so difficult for them to 
enter such a career. 


the interplay between applicant and organi- 
zation). When combined, these two items 
yielded a phenomenal validity of .70 against 
subsequent merit ratings. Hindsight made it 
relatively easy to understand why these two 
items were such powerful predictors of subse- 
quent job performance. These items were: “Is 
it all right to contact your boss?” and “How 
soon can you start work?” Applicants who 
subsequently were meritorious on the job 
gave an unqualified “yes” in answer to the 
first question. It was perfectly all right to 
contact their boss, Applicants who did not 


turn out well on the job either said “no” or 

but. . . Tho who turned into poor 
performers were ready to start work immedi. 
ately, while those who insisted on at least 
2-weeks notice were likely to be better sales- 
men, subsequently. Probably, what was being 
assessed by these dynamic items was the sense 
of responsibility and sense of security of 
applicants. 


The increasing attention we are giving to 
biographical information blanks points to a 
likelihood of focusing more fully on the social 
and organizational history of the appliant, for 
so much of what is contained in biographical 
information is of this form. Consider some of 
the factors which emerged in Cassen's (1966) 
factor analyses of biographical information 
blank items in three cultures: upward mobil- 
ity, family attitudes, achievement by conform- 
ing, interpersonal relations, etc. 


Testing 

There has been a merger of interest of so- 
cial and personnel psychologists concerning 
attitudes toward work. There seems to be no 
simple response to the question “Why men 
work?” Much research suggests quite strongly 
that what may be rewarding for the task- 
oriented worker may be punishing to the inter- 
action-oriented worker (Bass, 1967b). Non- 
cognitive dimensions of particular importance 
for selection and placement testing lie close 
to work itself or to organizational considera- 
tions. Heretofore, to a considerable degree, 
personality and motivation assessment have 
developed within and clinical 
psychology and have been lifted bodily for 
application to industry. There has been more 
concern about neuroticism than there has been 
for need for achievement, yet there is likely 
to be more payoff from concentrating on the 
latter. Even in the case of hospitalized mental 
patients, it seems more prognostic in voca- 
tional rehabilitation efforts for them to know 
about how task oriented they are than how 
neurotic they are (Distefano & Pryer, 1964). 

There has been little further exploration so 
far of some intriguing evidence that ability 
and aptitude tests tend to be more predictive 
of early success in organizations while interest 
and personality factors tend only to become 
useful for prediction of criteria where the 


f- 


i criteria are measured much later le the ap- 
Pikantes subsequent career. This proposition 

l as well as others which are being generated 
about the dynamic nature of criteria cannot 
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help but result in a more sophisticated ap- 
| proach to forecasting using paychometric 
Measurement 


* 

Early in the history of industrial psychol- 
ogy, the miniature test was devised for selec- 
tion purposes. Replicas of the real-life job 
were constructed and performance evaluated 
these simulated situations, Now we se 
more use of such simulations of organi- 
for assessment purposes. Such aimo- 
can be accomplished through the in- 
„ technique, leaderless discussions, 
games, organizational design prob- 
and the like, To date, there has been 
i ly little exploration of the possibili- 
of creating for assessment purposes rela- 

tively simple games, cach representing impor- 

> tant organizational dilemmas with which to 

"confront applicants who are seeking careers 
in those organizations. 

Culture fair testing is still another critical 
issue lying between personnel and social psy- 
chology. Is culture fair testing even theoreti- 
cally possible if past behavior is the best pre- 
dictor of future behavior? Is it possible to 
rule out cultural influences in testing without 
eliminating the validity of the tests? To what 
extent does previous organizational history 
play a role in test responses? How different is 
the applicant of today who grows up in a 
school system where the administration of 
< Objective tests is an almost weekly routine 

_ from the applicant of the last generation for 
whom objective testing was more of a novelty? 
4 s long as there is a basic mistrust between 
the applicant and the organization, we will 
live in a situation where the sensible appli- 
cant attempts to distort and fake his results 
on noncognitive tests. In what ways 
sufficient trust be promoted between appli- 
cant and organization to increase the 
cant’s desire to give honest responses rather 
‘than socially desirable ones, or responses 
which he judges to be ideal or like those of 
“his boss or like those that would land him the 
ob? 

Still another issue in testing which may 
nvolve public and organizational policy ques- 
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results may be handled more adequately in 
the future by better statistics. For instance, 
more attention to modifier variables may be 
required. Actually, modifiers may be merely 
symptoms of curvilinearity in the predictors. 
Under such circumstances, a Bayesian solution 
will always yield more accurate decisions, and 
in the last analysis will match the regression 
decisions when the data is linear (Clampett, 
1966). At best, Sawyer’s (1966) comprehen- 
sive review of research on methods for com- 
bining assessments strongly indicates that 
predictive efficiency is increased when inter- 
viewer judgments are mechanically combined 
or synthesized with test data to yield final 
predictions in contrast to following the usual 
practice of having interviewers make clinical 
judgments about the job applicant from his 
test scores and interview performance. 

The problem and its solutions therefore 
may be stated as follows: no matter what, 
interviews will be held with applicants. To 
insure that the interviews do as little damage 
as possible to the accuracy of the entire selec- 
tion process, it may be best to increase the 
number of recruits who are tested, thereby 
reducing the selection ratio and at the same 
time increasing the likelihood that all those 
who passed the first test screen, and as a 
consequence receive interviews, are all fairly 
good prospects for the job in question. At the 
same time, the interviewer’s judgments should 
be treated like additional measurements to be 
combined statistically with test data available 
on the applicant to provide a final prediction 
for him. This required change in practice is 
an organizational question since the inter- 
viewer’s role is reduced in importance. Re- 
sistance is likely particularly if the inter- 
viewer has considerable status or experience 
but in its resolution may lie the major por- 
tion of the utility of a selection program. 
Training 

Commenting on the fads and fashions in 
training is like commenting about the weather 
and what can be done about it. Nevertheless, 
organizational matters are vital in the deci- 
sions of training programs to be introduced, 
continued, or abandoned. Let me enumerate 
some well-known training problems at the in- 
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terface of personnel and organizational psy- 
chology. 

1. Trainers have discovered to their cha- 
grin that it is one thing to maintain an ef- 
fective program which meets its training ob- 
jectives, but it is another to create a training 
program which both meets its objectives and 
obtains the approval of the trainees. Unfor- 
tunately, evaluation and continuation of the 
program are more likely to rest upon the latter 
rather than the former. 

2. Familiar also is the organizational prob- 
lem associated with the returning trainee. 
Fleishman (1953) provided sufficient evidence 
that whether or not the effects of the train- 
ing program made their appearance on the 
job 6 months after training depended primar- 
ily on the attitudes of the boss and the 
climate of the organization to which the 
trainee returned. 

3. The professional and the manager need 
to see themselves engaged in “life-long 
learning” to avoid obsolescence. The organi- 
zation can help considerably to maintain ex- 
pectations that its members must continue to 
keep up with new developments. 

4. To a considerable extent much more 
attention may need to be paid to a basic 
conflict between management trainer and 
management trainee. Managers see themselves 
performing a complex art whereas one may 
be trying to teach them simple science. A 
recent unpublished experiment by Alex Ba- 
velas is relevant here and illustrates the tre- 
mendous difficulties involved in the training 
of people under these circumstances. One sub- 
ject learns to discriminate the slide photos of 
healthy and sick cellular tissue through ap- 
propriate reinforcement of his responses. A 
second subject receives the same reinforce- 
ment schedule regardless of what responses 
he makes. The first subject forms a few sim- 
ple hypotheses about what differentiates pho- 
tos of a healthy and a sick cell; the second 
subject forms a complex set of hypotheses 
since his reinforcements have not been asso- 
ciated in any simple way with his different 
responses. Unfortunately, when the two sub- 
jects discuss the matter, the one with the 
complex art of judgment is more confident, 
more resistant to change, and less readily in- 


than the other subject with the sim- 
hypotheses. 

Job Design 

_ As the organization introduces automation, 
there is a flattening of the distribution of 
skill demands on newly created jobs and the 
‘remaining old ones. More routine jobs as 
well as more skilled jobs emerge while those 
at intermediate levels of skill demand are 
“abolished. One can only vaguely foresee the 
creation of less-fluid worker castes, the un- 
skilled, and the elite, unless some effort is 
‘made to intervene with designs for more func- 
tional intermediate-level jobs. For instance, 
electronic data processing (EDP) has created 
the elite programming job. It has also created 
the unskilled job of pressing bent IBM cards 
which are returned by mail. 

At the level of the manager, there seem to 
be two diametrically opposed predictions 
about how EDP is changing the organization 

of the future and the jobs within it. On the 
one hand Leavitt and Whisler (1958) see 
greater centralization decreasing responsibili- 
dies accorded middle managers; others see 
the reverse. Selection and training of future 
middle managers need to take account of 
EDP, but how to do so in the face of the 
uncertain effects of EDP is a dilemma we 
face at this time. Individual differences play 
an important role here. For instance, younger 
men at the same level in the organization see 
that their own jobs are changing as a conse- 
quence of EDP much more so than do older 
men at the same level of management 
(Vaughan & Porat, 1967). 


Compensation 


For too long the questions concerning 
equitable compensation have been left in the 
hands of organizational rationalists. We are 
just beginning to see the full extent of indi- 
vidual differences in attitudes toward com- 
pensation. For instance, in a simulation we 
use for training managers, the managers must 
award salary increases to 10 engineers each 
ol whom differs in merit and in other at- 
tributes. Each has a job somewhat different 
than those of the others who are to be as- 
signed salary increases. Wide differences ap- 
Pear in the average increases recommended. 
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North Europeans seem to feel that 4-695 is 
equitable, while South Europeans may push 
for as high as 36%. Obviously, differences in 
the rate of inflation in the economies of the 
different countries in question are of conse- 
quence, But just as important are the atti- 
tudes toward technicians and engineers that 
vary greatly from one country to the next. 
In this same exercise, American, British, and 
Irish managers generally seem unmoved by 
the possibility of losing good men to competi- 
tion and do not feel that counteroffers should 
be the basis for salary increases. In fact, 
they sometimes tend to punish men who 
receive offers from other firms. Contrarily, 
Flemings, Norwegians, Italians, Indians, and 
Latin-Americans tend to be more prone to 
award relatively large increases to men with 
counteroffers. Even among managers from the 
same firm, we can see some opting for ex- 
tremely complex differentials while others in- 
sist that regardless of merit or job or senior- 
ity, all engineers ought to receive the exact 
salary increase (Bass, 1967a). 


Criteria of Individual Performance 


Concern for an employee's performance in 
a firm must be viewed in terms of his firm's 
objectives. For example, Smith sells more 
merchandise faster than Jones, but Smith's 
customers complain more about their pur- 
chases. Smith brings in more new, nonrepeat 
business, but Jones has more steady, satisfied 
customers. Who is the better salesman? The 
answer on the firm’s goals on what 
it values most. Smith is the better salesman 
if the firm is concerned most about its cur- 
rent share of the market. Jones is the better 
salesman if the firm is concerned about its 
long-term standing in the market. 

We must understand something about how 
to assess organizational worth if we are to 
appreciate how individual personnel differ in 
their contributions to it. Individual difer- 
ences in the proficiency of executives, for 
instance, can only be made meaningful if we 
understand the purposes of their jobs. This 
in turn entails determining the purposes of 
the department’s divisions and ultimately the 
purpose of their enterprise (Fiske, 1951). 
Assessing the adequacy of the performance of 
a manager can be tricky business indeed. As 
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Shartle (1956) pointed out, one may discover 
10 years too late that an executive who con- 
tributed the most to the firm had been dis- 
charged. 


ORGANIZATIONAL Issues INVOLVING 
PERSONNEL QUESTIONS 


We have seen that the problems of recruit- 
ing, application forms, testing, interviewing, 
training, job design, and compensation, all 
ordinarily the primary interest of the per- 
sonnel psychologist, are likely to contain so- 
cial issues of interest to organizational psy- 
chologists, as well. Now let us reverse roles 
and look at some problems which usually are 
the concern of organizational technicians 
which nevertheless contain significant person- 
nel questions as well: supervision, communi- 
cations, conflict resolution, team composition, 
and organization design. 


Supervision 

For 25 years, there has been major interest 
by social psychologists in the utility of demo- 
cratic or permissive rather than directive 
supervision. But evidence is continuing to 
accumulate that what type of supervision 
works best is often a matter of individual and 
cultural differences. In a number of studies, 
more directive approaches are favored by 
subordinates. Indeed, much direction is more 
often expected in many locales. For instance, 
in a recent pilot study completed in Spain, 
those who acted as subordinates were much 
more favorably disposed toward supervisors 
who attempted to persuade them rather than 
who attempted to share the decisions with 
them, for, they said, a supervisor who does 
not try to influence his subordinates fails to 
accord the subordinates the dignity to which 
they are accustomed by showing that he really 
does not have sufficient interest to bringing 
them over to his position. Numerous studies 
in the United States suggest that those who 
are highly authoritarian prefer in turn to sub- 
mit to authority rather than to operate in an 
environment with opportunities for sharing 
decisions with their superiors. In turn, they 
expect to make decisions for their own sub- 
ordinates (Bass, 1965). 


Communications 


In a similar vein, two-way communication 
is thought more effective as well as more sat- 
isfying than one-way communication. Yet, a 
group of Japanese frustrated an American 
management trainer who was trying to dem- 
onstrate the differences and relatively greater 
values of two-way communication. Among 
the Japanese, with whom he was working, the 
ease of communicating a pattern of rectangles 
one-way was somewhat greater for the task 
which was imposed, and as a consequence 
there was considerable confusion on the part 
of the students who were ostensibly being 
taught and shown the value of two-way over 
one-way communication, We have had simi- 
lar experiences with engineering students who 
if they are communicating the pattern of 
rectangles to others with a great deal of 
background in mechanical drawing and blue- 
print reading, may communicate faster and 
more effectively one-way than two-way. Given 
individuals with common codes, one-way 
communication can be more effective and 
more satisfying than two-way communication. 


Conflict Resolution 


Some zero-sum games where one party wins 
only under the condition that the other party 
loses are situations where conflict resolution 
is impossible for almost anybody. In some 
mixed-motive games, resolution will or will 
not occur, depending on who is playing the 
game. If one is able to compete with a gen- 
erous opponent, it is possible to fleece him. If 
one himself is nurturant, speedy and highly 
satisfying resolutions can be achieved 
(Loomis, 1959). There would seem to be con- 
siderable payoff in the study of the interac- 
tion of person and structural conditions in 
various types of negotiating situations. Thus, 
Bass (1966) found that task-oriented nego- 
tiators were most likely to achieve settle- 
ments of high quality in contrast to settle- 
ments reached by less task-oriented negoti- 
ators. 


Team Composition 


We are beginning to understand the per- 
sonal ingredients in the assembly of members 
of a group required to achieve particular 
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outcomes. For example, evidence accumu- 
lated so far suggests if we put together a 
number of highly task-oriented members, we 
‘are likely to generate a great deal of socio- 
emotional conflict, although at the same time 
‘the chances are reasonably good that plenty 
‘of work will get done. At the same time if we 
mix together a group of primarily interaction- 
oriented personnel, the likelihood is that the 
group will be most satisfying to the members, 
there will be much play, but relatively little 
‘work will be accomplished (Bass, 1967b). 
Organizational Design 

To conclude, let us look at what rational 
organizational designers would suggest as in- 
violate principles of organizational design and 
nat the same time note the very opposite points 
ol view voiced by behavioralists. 

The rationalists would say that an organi- 
zation should be designed so that someone 
is responsible for supervising all essential ac- 
tivities. The behavioralists would be primarily 
concerned with creating structures where 
leadership was shared. 

The rationalists would argue that responsi- 
“bility for specific acts should not be dupli- 
cated or overlapping. The behavioralists 
would feel that overlapping had much merit, 
providing opportunities for cross-training, 
backup, and increasing the reliability of the 

System of interacting workers. 
The rationalists would argue for job simpli- 
fication; the behavioralists would push for 
the reverse, job enlargement. 
. The rationalists would insist that responsi- 
bilities should be written, clear, and under- 
Stood by job occupants. The behavioralists 
Would argue that each person brings to a job 
somewhat different potentials and should be 
given freedom and flexibility to develop his 
own particular way of doing the job to make 
the most of the situation in which he finds 
himself. 

The rationalists would argue that author- 
_ ity to make decisions should be commensurate 
with responsibility for those decisions. The 
behavioralists would state that authority can- 
not be assigned but rather goes with ability 
and esteem as an individual among associates. 
Furthermore, they would note that the ra- 
tionalists’ principle of authority primarily 
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makes it easy for some people to shirk 
sponsibility and also increases 
flict. 

The rationalists would want authority to be 
delegated so that decisions take place as 
close as possible to the point of action. The 
behavioralists would insist that the rational- 
ists do not mean what they say, for to ac- 
complish this goal they would accept the be- 
havioralists’ position that individuals should 
be responsible for decisions which affect 
themselves, whereas, in fact, the rationalists 
design organizations so that such decisions are 
lodged with the superiors of the individuals 
who must execute those decisions, - 

The rationalists have fixed notions about 
span of control; the behavioralists feel that 
the span of control creates as many problems 
as it solves. For example, the smaller the span 
of control in an organization the taller will be 
its structure resulting in greater separation 
of the top of the organization from the bot- 
tom. More possibilities will arise for filtering 
communications that must be transmitted 
through the organization. There will be greater 
differences between the goals of those at the 
top from those at the bottom of the hierarchy. 
Rather than concern themselves about span 
of control, the behavioralists would be much 
more interested in the opportunities for feed- 
back between subordinate and superior; span 
of control could be much greater where feed- 
back was accurate and easy. 

The rationalists insist on a chain of com- 
mand: the behavioralists say the chain of 
command is often a fiction interfering with 
the flow of communication that is needed by 
the organization, a flow which may be hori- 
zontal, diagonal, and in forms quite different 
than assumed by the rationalists. For exam- 
ple, the relation between the foreman and an 
assembly-line worker has been likened to that 
between a travel consultant and his client. 
The “real boss” in the situation is the ma- 
chine (Bass, 1965). 

Optimum organizational designs are likely 
to be found somewhere between rationalist 
emphasis on predictability and accountability 
through impersonal structure and behavioral- 
ist concern for interpersonal trust and inter- 
personal confidence as the bases of organiza- 
tional stability and growth. Again, one can 
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only agree that organizational design must 
attend to what has been learned about intra- 
personal as well as interpersonal dynamics. 
Where the optimum lies between the ration- 
alist’s and the behavioralist’s positions de- 
pends on the capabilities, training, and in- 
volvement of the personnel in the system. The 
fully programmed rationalist’s organization is 
closer to the optimum when personnel capa- 
bility, training, and involvement are low (and 
the rationalist’s organization is likely to keep 
personnel involvement low). When personnel 
potential is high, the behavioralist’s model 
becomes more feasible. 

The personnel specialist no longer can hide 
behind his validity coefficients, test blanks, 
and training manuals. In turn, the organiza- 
tion scientist cannot remain assured of the 
generality of results from his surveys, field 
studies, and rational analyses. Models for 
organizational and personnel research need 
to take into account sources of variance due 
to jobs and organizational environment, 
sources due to individual characteristics, and, 
most important of all, sources due to the pe- 
culiar interactions of individuals and environ- 
ments. 
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EXPLORATIONS OF A THEORY OF VOCATIONAL CHOICE: 


VI. A LONGITUDINAL STUDY USING A SAMPLE OF 
TYPICAL COLLEGE STUDENTS 


JOHN L. HOLLAND! 


American College Testing Program 


Some hypotheses in Holland’s theory were tested in a longitudinal study using 
large, diverse samples of college students, Using the appropriate operational 
definitions, students were classified as personality types and personality pat- 
terns and then examined to learn if they exhibited the characteristics suggested 
by the theory. This testing of concurrent and predictive relationships based 
on the theoretical formulations was generally positive. The testing of college 
influences upon vocational choice and satisfaction was accomplished by char- 
acterizing colleges by the Environmental Assessment Technique and students as 
personality types. These analyses of student-college interactions were partially 
successful. Taken together, the results support the usefulness of the theory. 


and self-ratings (Holland, 1963-64). The fifth 
taken to explore the usefulness of a theory report was another longitudinal study (Hol- 
of vocational choice and personality by test. land, 1964) that replicated some older hy- 
ing some of its hypotheses (Holland, 1959, potheses and tested some new and more re- 
1966b). The first and second monographs mote hypotheses. It also fostered several re- 
were longitudinal studies of bright students visions of the theory (Holland, 1959) and 


This report is one of several studies under- 


1 over 1-4 yr. periods (Holland, 1962, 1963). emphasized the need for a more explicit and 
The third study was a longitudinal study of comprehensive statement. Such a statement 
change in major field plans (Holland & has since been published (Holland, 1966b). 

Nichols, 1964). The fourth study examined Although these earlier studies support the 
some of the concurrent relationships between theory, they were all based on samples of 
vocational preferences, occupational images, high-aptitude high school seniors or college 
r i students. The purpose of the present report 
for ET pe 11 13 8 E Te was to test some of the hypotheses in the re- 
execution of this study: Thomas T. Frantz, Sandra statement of the theory and to do so in a 
W. Lutz, James M. Richards, Jr, Wiliam P. Alston, large student sample approximating the typi- 
Theodore Cooper, Leonard P. Rand, and Lorraine cal college student. Consequently, this ex- 
M. Rand. He is also indebted to Alexander W. amination of the theory’s hypotheses should 
nA , Leonard L, Baird, and D onald F. Fort 155 be more explicit, and the findings should be 
eee reading) e eee applicable to a more general population. 
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TABLE 1 
Nauru OF STUDENTS AND COLLEGES 


College Mak | Female 
Spring sampbe 

University of Alabama is 201 
Arkannas Polytechnic S n $2 
California State College at Hayward 6 “ 
Colorado State College a 107 
Southern Connectiout State College * 3733 
Wesleyan University (Connecticut) “j| — 

Indiana State University 6 164 
Burtington Community College (lows) ” i 
Kansas State University m| 216 
University of Kentucky 1» 182 
Westbrook Junior C (Maine) — 103 
William Jewell College (Missouri) "N w 
Plymouth State College (New Hass pabire) 86 


University of North I 


121 
Southeastern State C Oklahoma) 7 or 
Mount Mercy, er Sgi Aer 104 
Swarthmore — I M w 
Black Hills State (South Dakota) 60 38 
University of Tennessee 20$ 184 
8 (Texas) 64] 101 
F: t State College (West Virginia) 93 RS 
Total 2087 | 2662 
Fall sample 
California State A at Hayward 69 102 
Chico State — 1 (California) 10 173 
Amherst husetts) 238 — 
Uni of Massachusetts 739) 875 
Cuyahoga Community Feber TOi iso ise 
wr ommunſty ) 1 
‘otal 1576 | 1571 


Briefly, the theory consists of several sim- 
ple ideas and their more complex elaborations. 
First, we assume that we can characterize 
people by their resemblance to one or more 
personality types. The closer a person’s re- 
semblance to a particular type, the more likely 
it is that he will exhibit the personal traits 
and behaviors associated with that type. Sec- 
ond, we assume that the environments in 
which people live can be characterized by 
their resemblance to one or more model en- 
vironments. Finally, we assume that the pair- 
ing of persons and environments leads to 
several outcomes that we can predict and 
understand from our knowledge of the per- 
sonality types and the environmental models. 
These outcomes include vocational choice, 
vocational stability and achievement, personal 
stability, creative performance, and suscepti- 
bility to influence. These ideas, along with 
some operational definitions, are elaborated in 
The Psychology of Vocational Choice (Hol- 
land, 1966b). The following sections sum- 
marize the methods of collecting and analyzing 
the data and describe the findings. 


METHODS or AssEssMENT 


The data for the present study come from two 
American College Surveys described earlier by 


Joun L. Horraxo 


Richards, Holland, and Lutz (1966). Students were 
polled for their vocational aspirations and admi». 
istered several scales and inventories. About 8 or u 
mo. later, students were polled again for their aspir- 
tions and satisfactions 

Students came from two college samples. The 
freshmen in the fall sample of six colleges were 
polled in the fall of 1964 and in May of les an 
8-mo. interval. The spring sample of college fresh- 
men was polled in May of 1964 and again in May 
of 1965 when they were sophomores—a 12-mo. is- 
terval. The spring sample included 22 colleges. Both 
samples contained students with a great range of 
scholastic potential, vocational interests, and socio- 
economic status. Table 1 shows the samples of 
colleges and students. 

Because the follow-up data were obtained for only 
39% of the spring sample and 44% of the {all sam- 
ple, it was necessary to examine the differences be- 
tween students with complete pre- and postdata and 
students with only initial data. Comparisons of 
students with complete and incomplete data in an 
earlier study using the same sample revealed that 18 
out of 44 comparisons were significantly different 
for a variety of academic and nonacademic measures, 
but such mean differences always were less than one- 
quarter of a standard deviation (Richards ct al, 
1966). In the present study, comparisons of students 
with complete and incomplete data on the Voca- 
tional Preference Inventory scales, the most relevant 
variables in the present study, reveal that all mean 
differences are less than one-sixth of a standard 
deviation. The distributions of vocational choices 
are also similar, In short, there is no compelling 
evidence that the results would be substantially dif- 
ferent if we had obtained complete data on the 
entire sample, although this occurrence is always pos- 
sible. More important, attrition of the original sam- 
ple is of little relevance because the main concern of 
the study is the testing of theoretical relationships in 
the same sample rather than the estimating of 
parameters for the original population. 

A description of the assessment devices used in 
this study follows. The reliability estimates are based 
on the original spring samples of 6,289 male and 
6,143 female college freshmen. 

Vocational Preference Inventory (Sixth Revision). 
This personality and interest inventory is composed 
entirely of occupational titles (Holland, 1965). To 
take the inventory, a student indicates which oc- 
cupations he likes and which he dislikes. Scores on 
only the following scales were used for this study: 
Realistic, Intellectual, Social, Conventional, Enter- 
prising, and Artistic. Reliabilities (Kuder-Richardson 
20) for these 14-item scales ranged from .83 to .89 
for male college freshmen and from 76 to 80 for 
females, 

Preconscious Activity Scale. This scale is an a 
priori scale developed to measure Kubie’s (1958) 
notion of preconscious activity as a process in crea- 
tive performance (Holland & Baird, in press). The 
Preconscious Activity Scale is a 38-item, true-false 
scale with reliabilities (K-R 20) of .72 and 68 for 
male and female college freshmen. The predictive 


hes of this scale and its concurrent relstion- 
with originality and interest measures imply 
bat the Preconscious Activity Scale should be in- 
as an originality measure, especially in the 
of art, literature, and music 
je oj Competencies. Using a list of activities, 
is checked the activities which they claimed 
ey could do well or competently, The assumption 
ng these scales is that a large number of 
mmpetencies is conducive to achievement generally 
that competencies in a particular field are con- 
to achievement in that field. Typical items 
À this list include: I have a working knowledge 
Roberis’ Rules oj Order, I can dance, I am a good 
ok, I can make jewelry, I can read blueprints, 1 
read Greek, I can operate a tractor, I can use 
m tables, etc. Competencies were categorized 
y three judges into several areas of competence: 
fic, technical, business and clerical, social and 
tational, arts, and leadership 
init weights, students were then scored 
J of competency. The reliability for the 
scales used in the present study ranged from 
48 to 87 and from 38 to 8S for and women. 
very low reliabilities for a few scales appear to 
from the small number of items in such scales. 
Interpersonal Competency Scale. This 20-item, a 
priori scale was modeled after the work of Foote and 
(1955), who defined in com- 
ce as “acquired ability for effective interaction,” 


petencies, positive self-regard. The reliability (K-R 
20) of the Interpersonal Competency Scale for male 
‘and female freshmen was .69 and 67, respectively. 
a Dogmatism Scale. This scale, developed by Rokeach 
(1956) to measure dogmatic and rigid thinking, con- 
"sists of 40 truc-false items dealing with beliefs and 
Attitudes. (The first version by Rokeach is in mul- 
' tiple-choice form.) The reliability (K-R 20) was .77 
~ for male freshmen and .75 for female freshmen. 
Student Orientation Survey, Form C. Farber and 
Goodstein (1964) developed four a priori scales to 
| assess the student orientations implied in Trow's 
(4960) student typology. These scales are 
~ Collegiate, Non-Conforming, and Vocational. The a 
= priori scales were revised by an internal consistency 
P item analysis to 10-item 
scales, Reliabilities (K-R 20) ranged from 39 to 4S 


Other Descriptive Information. Students were 
polled for their educational 
b tions, their life goals, and their self-ratings. They 
© were also asked to indicate their choice of vocation 
_ and field of training and to provide in- 

formation. 

eh Students indicated the degree to 

lle goals and achievements were “essential, very im- 
portant, somewhat important, Or of little impor- 
tance” (e.g., being a religious person, making a con- 


Turoxy op Vocationat Chr 3 


tribetion io scientific ksewinipn big happy osd 


content). 

a Mist of traits and abilities sach as orighral- 
kign Hea asd comervetion, stedents reind 
their personal traits and abilities os a -Feed sale 
top 10%, above average, average, and Below sverzge 


a student’s VPI scale scores. 
Student Traits and Orientations 

In Tables 2-7, students have been das- 
sified by their highest VPI scale score, their 


l 
at 
f 
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TABLE 2—Continued 


| Highest VPI score | 
Student characteristics Real Int Soc Conv Ent Art F 
(N= | N= | (N= | (N= | (N= | (N= 
| 885) | 1774) | 845) | 428) 798) 630) 
E tHe x | 307| 325| 3.14 295| 338| 374 | 16.54 
r ype SD 1.69 | 1.74] 1.71 | 159] 1.65] 1.85 
Sic 5 x | 435 | 4.37 | 4.56 | 4.59 | 5.05 | 417 | 20.68 
legia . k 
Collegiate Ee sp | 138 | 192| 183 | 184| 180] 201 
Interpersonal competency 55 na | ps 2 ioe 5 n | 8 85 | us 28.68 


Note. All Fs are significant beyond the .0S level and the majority are significant beyond the .01 level, In Table 2, an F 


uals the .01 level. 
for various VPI codes or scales in Tables 


The 2 


2-7 are larger than the same codes or scales in all other tables, because 


Tables 2-7 contain data for students with and without follow-up data. All other tables are based only on data for students with 


pre- and postdata. 


Italicized means indicate theoretical predictions. If prediction is correct, italicized mean should be the highest X. 


Abbreviations: Real or R= 
= Enterprising; Art or A = Artistic. 


though the highest mean is not always sig- 
nificantly different from other means for the 
same characteristic, 19 out of 25 predictions 
(76%) accord with the theoretical models. 

Table 3 presents the student characteristics 
that go with a student’s highest and second 
highest VPI scale scores. This table clearly 
indicates that students with the same high- 
point scale can still be distinguished by their 
second highest scale. In Table 3, for instance, 
men with a Realistic-Intellectual pattern are 
distinguished by their scientific competencies 
and goals of developing a useful product and 
scientific theory, whereas men with a Real- 
istic-Social pattern are characterized by their 
social and educational competencies. In all, 
69 out of 92 predictions (75%) are con- 
sistent with the theory. 

In Table 4, students whose first two highest 
scales were the same were compared to see if 
they could still be distinguished by their third 
highest scale. To insure more reliable out- 
comes these analyses were performed only 
for the 20 three-variable patterns with the 
largest Ns. Despite this severe test of the 
validity of the personality patterns as assessed 
by the VPI, the F tests—although smaller in 
size and significant for fewer variables—reveal 
that the third scale still distinguishes one 
subgroup from another and usually in ap- 
propriate ways—25 out of 39 predictions are 
correct (64.1%). 


Realistic; Int or I = Intellectual; Soc or S = 


Social; Conv or C = Conventional; Ent or E 


The results for women are given in Tables 
5, 6, and 7 for one-, two-, and three-variable 
VPI codes. In Table 5, 21 out of 25 predic- 
tions are correct (84%). In Table 6, 52 out 
of 69 predictions are correct (75.49). And in 
Table 7, 24 out of 33 predictions are correct 
(72.7%). 

The results presented in Tables 2-7 imply 
that the formulations for the personality 
types have some construct validity. Similarly, 
the interpretations of student profiles—VPI 
two-and-three-digit codes—by the application 
of the typological formulations possess similar 
validity. 


Educational Aspiration, Satisfaction, and 
Achievement 


In Tables 8, 9, and 10, the relations be- 
tween personality types (high-point VPI 
codes) and satisfaction with college, educa- 
tional aspiration, and college grades were 
examined. In Table 8, the chi-square tests 
show that high-point codes are not inde- 
pendent of a student’s satisfaction with his 
college 8 mo. after he was assessed by the 
VPI, Students rated their satisfaction with 
their college by responding to the following 
item: 

How satisfied are you with your final choice ofa 
college? 

This is the best possible college for me that I aes 
of. 
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TABLE 3 
RELATION or Two-Dicir VPI Copes to Srvupent Cuaracreristics (Mex) 


| Two-digit VPI codes 


Student characteristics ＋ | hez- A 
RI RS RC RE | RA 
(N=317)| (N=97) | (N=92) | (N=117)| (N =46) 
Competencies | | 
TASE bd ee 322 | 354 | 339 | 3.46 | 10.43 
cece sD | 240 210 | 224 | 222 2.22 
oe X | 1509 | 1455 | 1510 | 1403 | 1415 | 258 
ge SD | 3.58 341 3.25 3.52 3.98 | 
= i X| 4% 545 | 438 4.68 5.26 | 402 
Social and educational SD 281 2.60 2064 283 304 
2 * 5220 4.71 4.93 5.68 7.96 | 4.46 
a SD | 435 | 371 | 437 | 531 3.75 
Life goals i be ** 
i X 176 1.30 1.65 144 1.39 | 7.28 
Developing useful product SD 84 ‘67 77 68 40 
PRENS X 176 1.27 1.47 1.40 1.30 | 12.49 
Developing scientific theory SD 81 53 74 74 ‘31 
; - X 1.00 1.53 179 1.86 1.57 3.85 
Becoming expert in finance and commerce SD 76 0 80 88 71 
5 ; X | 253 281 2.62 2.74 2.89 2485 
Active in religious affairs sp 99 93 90 96 90 
Self. ratings 
fared. X | 297 2.58 2.82 2.57 2.52 | 5,38 
1 SD 81 88 78 79 74 
a; X | 240 2.10 2.24 2.12 204 7.44 
Papin SD 03 70 70 82 93 
Personality and attitudinal scales 
; BraS X | 15.50 | 14.93 | 1442 | 1407 | 16.96 | 4.73 
Preconscious activity (originality) SD} 460 | 411 | 424 | 434 492 
esi 413 494 | 409 4.20 474 4.67 
ap adh SD | 185 15 47a.) 0 
IR IS IC IE IA 
(N=552) | (N=307)| (N=68) | (W=142)| (W=307) 
Competencies 
Technical X | 1447 | 1148 | 12.69 | 13.53 | 1208 | 30.06 
SD 3000 4.42 4.33 4.36 4.53 
Busi 185 1.97 231 2.35 193 4881 
r le 120 170 
Social and educati X 4.81 6.37 5.69 6.37 5.44 17.6 
5 SD 2887 | 286 | 299 | 296 | 307 4 
Artistic 1 7.04 6.13 8.30 9.93 | 27.57 
SD 5.20 4.97 6.24 | 6.35 5.86 
Leadershi be ines i 466 4.21 5.51 4.61 | 12.93 
5 SD | 290 307 | 297 327 | 313 
Life goals 
Develop ful x 1.86 1.61 1.63 1.69 1.74 5.37 
Sr al Set DEY SD 85 70 77 70 83 
Developing scientific theory 35 . 5 ay 1 8 ae se 3.08 
Producing artwork X 1.30 1.28 1.28 1.39 1.02 11.87 
SD 64 00 56 7⁵ 87 


SN 


>- = Net Ne 


| 1897 


Becoming expert in finance and 
commerce 


Active in religious affairs = 

Executive responsible for work of others gm 
-ratings 

Originality SD 

Mechanical ability SD 

Popularity SD 

Understanding of others sD 


Math ability SD 
nality and attitudinal scales 
Preconscious activity (originality) SD 


Dogmatism SD 
Vocational type SD 
Nonconformist type SD 
Collegiate type SD 


Interpersonal competency 


Competencies 
Scientific SD 


Technical SD 
Artistic SD 
Leadership SD 


Life goals pi 
Developing scientific theory SD 


Producing artwork SD 
X 
Becoming expert in finance and commerce ç 


X 
Executive responsible for work of others 5 


123222 53 


RRSRReRRER 


234234 3 
2222 H 2232322235 5 
27883584222 


a g 


i (N= Pee! Waw 


2 288487 


3235353217 


* 


222334 34a 14348433 


ai iat- iat: bal el at kat kat kai iai *S*S 
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TABLE 3—Continued 
Two-digit VPI codes 
Student characteristics z F> 
AR AI 8 AE 
(N= 41) | (N= 145) | (N=161)| (N=6) | (N = 146) 
Self-ratings | 
Mechanical ability — nae 7 7 7 a 4 Ga 
Popularity x 2.17 | 2.41 2.41 2.00 2.54 2.48 
a SD 7 E 78 
E 7 1 2 ; 2.67 
Understanding of others 5 | 2 = Í SR | 727 eo g A as 
* 32 
Math ability |} oo) es | | eee | a 
Personality and attitudinal scales 
Preconscious activity (originality) a | 15 a ce ren wel e 
; X 451 3.86 4.04 5.00 4.53 3.13 
Voriha Sy SD | 184 | 182 | 200 | 153 | iB | 
` 3 nd 3.56 4.00 4.04 4.50 4.77 4.86 
leain ype SD | 202 184 | 207 | 2.43 1.90 
* 05 11.2 11.43 9.83 12.09 3.40 
r e eee e | so | 2S — — 


Note.—See note of Table 2. 
All F ratios are significant beyond .05 level, 


This is a good college for me, but there are a few 
others that I think are better. ................... 2 
This is only a fair college, and there are many others 
which would probably suit me better 1 


The results for men in Table 8 conform 
closely with the theoretical statement (Hol- 
land, 1966b, p. 48): satisfaction should be 
highest for the Intellectual type, and in de- 
scending order, Social, Artistic, Conventional, 
Enterprising, and Realistic. The rank-order 
correlation between the actual mean satisfac- 
tion scores and the mean expected scores 
equals .83 (p < .05, one-tailed test). At the 
extremes, the male Intellectual type is most 
satisfied with his college and the Realistic 
type is least satisfied. The results for women 
follow an unpredictable pattern and do not 
conform to expectation. 

In Table 9, student educational aspirations 
are not independent of high-point codes for 
each sex ( < .001). The two rank-order cor- 
relations (each rho=.77) between the ob- 
served and predicted medians for the six 
types are positive, but they are not significant. 
The expected level of educational aspiration 
by personality type is identical with the 
theoretical ordering reported earlier for satis- 


faction with college and also constitutes the 
expected order for level of academic achieve- 
ment in Table 10. 

In Table 10, college grades, for men, are 
not independent of personality types (% < 
05). For women, grades and types are in- 
dependent of one another (p > 05). Although 
grades, for men, are not independent of types, 
the rho of .71 between the observed and ex- 
pected median grade for each type is not sig- 
nificant. (A rho of .83 is needed for a one- 
tailed test at the 5% level.) 


Vocational Role Preferences 


In a follow-up survey, about 8 or 12 mo. 
after the administration of the VPI, students 
responded to the following item: 


What special role would you like to play within 
your first occupational choice? 


Being a practitioner of my occupation. 

Training or teaching others about my occupation. 
Leading or supervising people. 

Doing research in my field. 

Acting as a consultant or expert to others. 
Undecided. 

Other role. 


- 
oa 
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TABLE 4 
ReLation or Tuxxx-Diorr VPI Cones ro Sruoexr eee (Mex) 


— — — — — 


Student characteristics 


ducing artwork SD 


ming expert in finance and commerce on 


ings 
larity SD 


ity and attitudinal scales 
Preconscious activity (originality) D 
Collegiate type Es 
Interpersonal competency z 


npetencies 


Social and educational 5 
Artistic a 
Leadership 55 


Life goals 
Developing scientific theory SD 
Becoming expert in finance and commerce = 


Executive responsible for work of others BA 


Self-ratings - 
Originality SD 
Math ability 175 
ersonality and attitudinal scales PA 
Preconscious activity (originality) SD 
“ae X 
Vocational type SD 
2 26 
Collegiate type SD 


X 
terpersonal competency SD 
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TABLE 4—Continued 


Three-digit VPI codes 


Student characteristics F 
IAR TAS IAC IAE 
(N=93) | (N=94) (N=7) (N=49) 


Competencies 


Technical X 12.87 10.89 11.57 12.90 3.91 
ns SD 4.35 4.34 3.33 4.48 
Rü X 1.74 1.88 3.14 2.22 3.83 
TEA SD 1.29 1.14 83 1.37 
Ae : 4.55 6.37 4.71 6.37 7.31 
Social and educational SD 3.04 301 1.67 2.76 
P X 3.76 5.01 4.43 6.10 6.45 
5 SD 2.75 3.24 2.92 3.39 
Life goals 
5 1.87 1.47 1.71 1.92 5.15 
Developing useful products SD 87 65 70 92 
= 5 8 X 1.38 1.30 2.57 1.90 12.16 
Becoming expert in finance and commerce h 65 71 90⁰ 95 
i P x 1.87 2.11 2.86 2.40 6.28 
Executive responsible for work of others SD 89 94 64 97 
Self- ratings 
9 ez X 2.83 2.90 2.00 2.82 2.83 
Caponi SD ‘84 73 53 82 
t E X 2.54 2.07 2.57 2.10 5.21 
Mechanical ability SD 92 8⁴ 1.05 ‘84 
4 X 2.14 2.32 3.00 2.55 5.21 
e SD 74 60 1.69 70 
Personality and attitudinal scales 
: X 4.20 3.70 4.71 4.73 3.98 
es cout type SD 1.80 1.80 148 1.50 
X 9.76 11.00 9.43 12.37 5:65) 
Interpersonal competency SD 3.08 4.05 1.50 3.24 
| 
ISR ISC ISE ISA 
(N=84) (N=12) (N=60) (N=60) 
Competencies 
. X 1.70 1.67 2.45 1.88 3.74 
Busines sD 121 1.43 1.50 142 M 
nae X 6.30 8.08 7.17 9.25 44 
ei SD 4.40 5.47 451 5.45 
Life goals 
. X 1.18 1.42 1.12 1.53 5.34 
Produeing artwork SD 47 86 45 88 
Self-ratings 
iginal X 2.40 2.50 2.28 2.70 4.30 
ii 
Originality SD 64 65 61 7¹ 
Mechanical ability Ge 2 x 255 Ee K 2.95 
Personality and attitudinal scales 
P st nies X 17.61 17.92 16.32 21.28 12.49 
Preconscious activity (originality) SD 435 3.86 4.93 4.77 


Note. — See note of Table 2. 
a All F ratios are significant beyond .05 level. 
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TABLE 5 
RELATION OF ONE-Dicit VPI CODES To STUDENT CHARACTERISTICS (WOMEN) 


One-digit VPI codes 


Student characteristics Real Int Soc | Conv | Ent Art P 
(N= | (N= | (N= | (W= | (Nm | V= 
31) 554) | 3033) | 271) 148) | 1284) 


Competencies | 


x| 3s2| 451| 283 | 228 | 247 | 287 | 68.84 


e 5% 249 | 242| 214 | i81 | 214 | 221 
Be. R oas | 563 | 478| 490 | $26 | 497 | 19.72 
Eschnical sp| aio | 331 | 307 | 333 | 309 | 323 
x| 200 1% 203| 2% | 232 | 202 | 929 


| 
| Business 
| 


Social and educational 


ae sp| 537 534| 550| 458 | 611 | 64t 
“hs XK 3.55 4.24 5.07 4.16 5.53 4.97 | 12.75 
pene sp) 291 | 297 | 307 | 3.07 332 | 3.41 
Life goals 
Delon fad X 1.29 1.35 1.19 1.20 1.23 | 1.24 9.05 
eveloping useful product SD 45 63 50 57 85 58 
Develoci entific th Xx 142 1.74 1.19 1.16 1.21 1.23 | 85.15 
eveloping scientific theory SD 55 88 52 40 55 60 
y X| 148 1.56 1.45 1.41 1.66 1.97 | 65.66 
Producing artwork SD 84 87 70 75 94 1.06 
$ A X 1.45 1,34 1.37 | 1.85 1.72 1.33 | 31.58 
Becoming expert in finance and commerce 5% 66 66 70 1.01 91 08 
Pent ee ; 1 2.74 3.03 2.96 2.62 2.72 | 26.08 
Active in religious affairs SD 95 1.03 94 93 99 1.05 
1.84 2.01 2.14 2.32 1.85 15.84 
Executive responsible for work of others 157 ie ‘84 88 91 90 87 


Self-ratings 


Originality eee 
PI 230 135 157 17 | 1.68 163 | 21.09 

A sp| 30 17% f 78) 78 
1 226 223 | 236 | 227 2% | 233 455 

Popularity, sp| 67 65| | 66 | | 65 
258 | 271 | zes | 260 | 2.78 284 9.74 

Understanding of others SD 71 73 67 63 67 7⁰ 
D| aio 2345 | 190 | 232| 195| 186 | 38.58 

Math ability 5 82 % 8 | 97 | 83 | 88 
P| 203 | 222| 218| 291 | 220| 2.0 6.68 

Conservatism cet gel at teres: [ses !|" a] 8 
Personality and attitudinal scales y| 17.52 | 19.81 | 1812 | 1449 | 17.07 | 22.20 |215.60 

Preconscious activity (originality) sp| 4.76 461 4.49 430 494 450 
D| 1668 | 1626 | 17.05 | 1826 | 17.08 | 16.60 | 6.56 

Dogmatism sp| 650 | 532 | 553 | 530 509 | 548 
D| 4% | 500 | 457 | 427 | 439 | 487 | 10.78 

Academic hype 55 19 | 203 | 192 | 182 | 193 | 195 
P| 471 | 416 | 457 | 501 | 458 40 3024 

Vocational type SD 1.51 1.65 1.67 1.79 1.67 1.68 
„ 32 | 282 | 274 | 261 | 294 | 312 | 11.10 

Nonconformist type Sp) iS 1.55 1.58 1.53 1.88 1.74 
NojiIiIi -| 40> | 19:98 

Collegiate type sp| 176 | 199 | 197 | 195 | 197 | 2.06 
2 1006 | 10.63 | 11.78 | 10.51 | 11.51 | 11.62 | 1893 

-a Interpersonal competency sp| 3.39 | 3.30 | 3.20 | 3.35 3.45 | 3.26 


Note.—See note of Table 2. 
* All F ratios are significant beyond .05 level, 
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TABLE 6 
RELATION OF Two-Dicir VPI Copes To STUDENT CHARACTERISTICS (Woxen) 


Two-digit VPI codes 
Student characteristics | ined | aa | R 
IR 18 10 IE IA | 
(N=32) | (N=234)| (N=25) | (N=14) | (V=150) 
Competencies | 
i Xx 7.09 5.40 6.96 4.57 5.29 3.82 
Technical SD| 3.42 3.07 | 3.39 2.87 3.25 
Diei 178 2.02 2.52 2.57 1.65 5.03 
N SD 99 1.22 1.20 1.24 1.19 | 
P : ; X 4.62 7.09 6.72 8.21 6.27 7.34 
Social and educational SD 281 2.67 2.88 2.581 302 | 
PEE 5 X| 7.44 9.58 7.24 10.43 10.57 | 3.84 
5 SD| 4.41 5.57 3.95 4.84 5.31 
; 2 4.45 4.72 6.29 3.89 3.83 
Leadership SDA 2.52 3.03 2.89 2.99 2.83 
Life goals 
A Ter Ae | 1.47 1.42 1.40 2.21 1.71 5.37 
Producing artwork SD 83 78 57 94 ‘00 
5 : Kalin) 1222 1.35 1.60 1.93 1.27 4.57 
Becoming expert in finance and commerce SD 34 67 80 80 60 
5 0 ra 0 1.87 2.20 2.43 1.75 4.90 
Executive responsible for work of others SD 71 80 89 1.05 83 
Self- ratings 
. X ETTO 2.26 2.04 2.50 243 | 3.02 
3 55 75 65 77 73 73 
: 5 X 2.62 1.75 2.08 2.07 1.81 9.20 
Mechanical ability SD 80 80 ‘80 88 77 
8 X 2.09 2.20 2.16 271 2.14 3.79 
popularity Sp| 152 ‘66 Si 70 63 
i X 2.34 2.80 2.48 3.29 2.71 5.76 
Understanding of others SD 60 ‘67 75 70 77 
11 22 2.25 2.84 2.43 2.35 3.46 
Math abilit 
AAU, SD 84 04 92 1.05 1.01 
Personality and attitudinal scales 
0 aes ance X| 18.50 19.21 16.84 18.93 21.69 | 11.91 
Preconscious activity (originality) SD 415 4.56 4.44 4.33 3.95 . 
‘Academic t X 444 5.10 3.92 5.07 513 2.7 
a ine. SD| 185 1.98 1.49 1.22 2.14 
Collebiate t X 3.66 4.00 4.72 6.14 443 4.51 
5 5 207 1.93 1.95 1.88 1.92 . 
x 9.78 11.16 10.72 12.36 10.08 4.1 
Int 1 . 
5 SD| 3.67 3 3888 2.84 3.13 
SR SI sc SE SA 
(N= (N= (N= (N= (N= 
21) 444) 360) 536) 1077) 
Competencies 
Scientific * 3.10 4.00 2.31 2.54 2.73 | 43.97 
SD| 2.31 2.34 1.95 2.06 2.01 A 
Technical X| 695 5.25 5.09 4.63 4.50 | 8. 
mai SD| 3.37 3.43 3.11 2.94 2.92 a 
Busi X 2005 1.95 2.22 2.12 1.95 4.43 
oa 5 109 1.23 1.23 1.29 1.24 


| 
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TABLE 6—Continued 
Two-digit VPI codes 
Student characteristics E J 81 | sc SE A 7 P> 
ine | Wa |. We | We | ite 
21) 444) 360) 536) 1077) 
Social and educational x 7.29 7.91 7.74 8-H 8.10 4.55 
SD 2.00 2.49 2.24 242 2.36 
Artistic X 8.29 9.27 8.20 9.80 11,52 | 32,62 
SD 5.48 5.29 4.66 5.24 5.68 
Leadership x 4.90 4.95 4.71 5.40 5.18 3.19 
SD 3.10 2.97 3.06 3.09 3.06 
Life goals 
Develont X 1.19 1.24 1.15 1.16 1.19 2.38 
eveloping useful product SD 66 rm 43 44 52 
Developi 2 x 1.19 1.36 1.17 1.15 1.16 12.85 
eveloping scientific theory SD 66 68 50 40 40 
>, 5 TEN X 1.33 1.43 1.33 1.41 1.58 9.01 
Produeing artwork SD ri 77 65 75 80 
Becoming expert in finance and commerce 5 5 11 1 4 * 14.06 
ait AOE ; x| 276 | 303 | 314 | 295 | 306 | 282 
Active in religious affairs SD 97 ‘94 80 97 94 
5 k z X 1.76 2.05 2.12 2.12 1.94 5.78 
Executive responsible for work of others SD ‘81 92 80 88 8 
Self-ratings 
on, z| | 2 ao | aa | ex | om 
Mechanical ability 55 en Le mo K 5 ae 
Popularit X 2.29 2.34 2.23 2.43 2.37 5.86 
bi SD 55 65 51 65 65 
Understanding of others ko 55 5 9 af Ea a8 
Math ability FA 18 a EA 1555 es * 
c 4 X 2.10 2.23 2.23 2.10 2.20 3.68 
‘onservatism SD 53 65 61 58 60 
Personality and attitudinal scales 
s ae x uae X 15.71 18.52 15.62 16.91 19.75 | 86.67 
Preconscious activity (originality) SD 4.59 4.19 3.73 4.28 4.24 
D F X 16.14 16.95 18.21 17.21 16.91 4.30 
ogmatism SD 5.44 5.37 5.30 5.62 5.45 
z X 3.57 4.87 4.46 4.35 4.68 7.14 
Academic type SD 2.01 1.82 1.83 1.93 1.95 
F x 5.14 4.37 5.05 4.68 440 | 13.58 
Vocational type SD 1.32 1.69 1.61 1.66 1.60 
. x 2.76 2,74 2:52 2.87 2.73 2.56 
Nonconformist type SD 1.80 1.54 1.51 1.67 1.59 
a X 5.62 5.09 5.25 5.66 5.03 10.14 
Collegiate type SD 1.65 1.96 1.76 1.94 2.02 
X 11.24 11.66 11.41 12.22 11.86 4.20 
Interpersonal competency SD 3.32 3.30 3.01 3.10 3.10 
CR CI cS CE CA 
(N=3) | (V=15) (N=117)| (N=53) | (N=21) 
Competencies x 4.00 3.33 2.04 2.34 2.10 2.40 
Scientific 55 2.16 1.96 1.76 1.83 1.71 


) 
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TABLE 6—Continued 


Two-digit VPI codes 


Student characteristics ~ E | P~ 
CR a | o CE CA 
(N=3) | (N=15) |(W=117)| (N=53) | (V=21) 
SS 77 rier OAs ee | deeb 52 | ase 
Technical 5 262 | 307 | 301 2.69 | 2.69 
Life goals | 
Becoming expert in finance and commerce — a pe ns rer ne 2.85 
3 x| 3.00 2.27 3.00 | 3.00 2.71 | 2.44 
Active in religious affairs sp| "(92 ‘37 92 97 ‘98 
are S 8 Yep ss 1.73 | 2.04 2.53 229 | 4.44 
Executive responsible for work of others SD 47 77 91 88 76 
Self. ratings 
AL n X 207 1.80 1.50 1.74 1.95 | 3.83 
Mechanical ability % 47 ‘65 05 8 ‘90 
; i X 1.67 2.33 208 | 262 271 | 2.85 
Understanding of others sp| 47 47 ‘60 ‘68 ‘70 
Personality and attitudinal scales 
z ; X| 5.67 2.40 2.00 281 267 | 3.10 
Nonconformist type SD| 1.80 88 1.40 1.60 1.61 
ER EI ES EC EA 
(N=2) | (N=5) | (N=44) | (W=13) | (W=25) 
a: OE E ke! 
Competencies | 
we X 5.50 7.00 | 1098 | 9.69 | 15.24 | 3.90 
. SD} 450 3.63 | 652 5.33 5.52 
Tenhin X| 250 5.60 6.32 | 3.38 7.12 3.50 
p? SD| S0 1.62 3.54 2.87 2.86 
Life goals 
panoi X| 1.00 2.20 1.32 1.23 1.16 | 2.96 
Developing scientific theory SD 0.00 93 63 42 61 
Personality and attitudinal scales 
* X| 13.00 | 1740 | 1595 | 1492 20.66 5.67 
Preconscious activity (originality) SD| 200 5.82 | 441 5.01 3.97 
AR AI AS AC AE 
(N=13) | (N=175) | (N=773) | (N=13) | (N=118) 
Competencies 
2 * 300 | 400 2.72 3.69 2.12 | 17.30 
en SD} 200 | 255 | 206 | 284 | 187 
RES * 797 5.45 479 | 3.62 5.09 | 4.71 
Tepes SD 374 | - 3.62 305 | 2.50 3.37 
1 50 e X| 6.00 7.25 786 | 6.15 718 | 5.63 
Social and edestian „/ 2e to onal 278 
PE? X 985 | 14.79 | 1414 | 1177 | 1382 | 240 
a SD} 592 | 721 | 626 | 736 | 628 5 
Leader 0 4.60 5.06 | 3.23 5.61 3.5 
pe SD, 208 3 | 306 251 3.08 
Life goals 
Developing scientific theory a 361 1 Se 55 55 ae s. 
A ; * 1.85 2.24 192 | 200 235 
P twork j 
Weins arti oe 5 1.03 1.13 1.04 78 1.02 
Becoming expert in finance and commerce 5 927 a 1 91 ae 4.45 


TABLE 6—-Continued 
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Two-digit VFI codes i 
Student characteristics = ie we * 17 
AR Al AS | AC AE | 
(N= 13) |(N= 17S) | (N= 773)) (N= 13) | (N= 118) 
— - — f l —_———$ — — 
Active in religious affairs p | nnn a4 | =- 2 a 4 
i tore x| 1 | 10 | 186 | 192 | 20 | a 
Executive responsible for work of others SD 72 ** 1 1.00 91 
Selſ· ratings 
1 zii x 223 18 1.56 146 Ls 7322 
al 
Mechanical ability $b T0 81 7 30 71 | 
1 TN . x 2.46 241 240 246 20 | 415 
Understanding of other Sp 8 72 71 w 6 | 
Personality and attitudinal scales | 
ee x| 2308 | zs | 2239 | 180 | 2123 | BS 
Preconscious activity (originality) Sp $23 437 424 | 44 443 | 
-arn A xX 4.08 LETI 4.76 | 338 44s 248 
ore sp) 24 | 20 | 209 | 213 | 196.) 
Interpersonal competency x fae ard 2 | — —— * 


SD. 


Note.—See note of Table 2. 
` All F ratios are significant beyond 0S level. 


The analysis in Table 11 was performed to 
learn if role preferences for students with dif- 
ferent VPI codes conform to theoretical ex- 
pectations (Holland, 1966b, p. 65, Table 7): 
The distributions of student role preferences 
for each VPI code were rank ordered and 
correlated with the expected ranks. The re- 
sults in Table 11 are all positive, but only 
thos of .90 or 1.00 are significant at the 5% 
level for a one-tailed test. In short, students 
with some high-point codes have very pre- 
dictable choices of vocational role, notably, 
Realistic, Social, and Conventional men, 
Social and Conventional women. 

To summarize, the evidence in Tables 2-11 
suggests that the personality types are useful 
models for interpreting and organizing a broad 
range of information about students. Despite 
some undesirable overlapping of character- 
istics across types, the majority of expected 
typological characteristics conformed to the 
theoretical formulations. However, some re- 
visions of the theory are needed for the role 
expectations for each type; for the assignment 
of a few traits to types, and for the applica- 
tion of the theory to women. 


PREDICTION oF VOCATIONAL CHOICE 
AND OTHER CRITERIA 


The theory assumes that Realistic types 
select Realistic occupations and that Intel- 
lectual types select Intellectual occupations, 
etc. The following analyses test this main 
hypothesis with different assessment devices— 
the VPI and the student’s initial expressed 
vocational choice. In addition, several closely 
related hypotheses were examined: the im- 
portance of a student's preference of voca- 
tional role in prediction, the prediction of 
degrees of change in a student’s vocational 
aspiration, and the improvement of predictive 
efficiency by using various combinations of 
predictors or special attributes of the pre- 
dictors. 


Classification Scheme 


For the following analyses and all other 
analyses in this report, student vocational 
choices were categorized according to the 
classification scheme developed earlier: six 
categories for men—Realistic, Intellectual, 
Social, Conventional, Enterprising, and Ar- 
tistic—and eight categories for women—In- 
tellectual, Social-Intellectual, Social-Conven- 
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TABLE 7 
RELATION or THREE-Dicit VPI Copes ro STUDENT CHARACTERISTICS (WOMEN) 


| Three-digit VPI codes 


Student characteristics R 3 5 5 Fa 
SIR SIC SIE SIA 
(N=22) | (N=34) | (N=78) | (N=175) 
| 
Competencies 
Te X| 7.45 6.12 5.41 5.05 3.51 
cule SD 5.29 3.54 3.60 3.16 
AiG X 8.91 297 9.62 10.58 2.63 
AnS SD| 4.07 5.99 5.56 | 5.38 
| 
Personality and attitudinal scales 
Preconsclous activity (orginally) X 16.73 16.74 18.18 19.55 7.23 
SD 4.79 4.13 3.94 4.02 
5 n ` x 4.36 5.06 4.72 4.11 4.11 
8 5 1855 1.76 1.66 1.75 
3 X 12.86 10.41 12.32 11.71 3.77 
Interpersonal competency SD 3.49 2.86 2.84 3.27 
SER SEI SEC SEA 
(W=15) | (N=43) | (N=130) | (N= 173) 
Competencies | 
P x 1,93 3.35 2.09 2.48 4.95 
Scientific 55 157 1.96 1.68 2.09 
2 X 6.93 4.77 4.39 4.49 3.67 
Technical SD| 326 2.55 2.88 2.87 
5 5 X 7.93 8.81 8.06 8.89 3.53 
Social and educational SD 272 247 252 217 
ave X 7.60 9.88 8.54 11.17 7.75 
3 55 3667 4.04 5.05 5.36 
Life goals 
3 X 1.67 1.21 1.00 1.17 7.36 
Developing useful product SD 87 55 31 4 
x e A X 1.47 T23 1.12 1.12 3.50 
Developing scientific theory SD ‘81 50 30 40 
Self- ratings 
j 515 X 1.60 1.93 1.92 1.69 3.15 
Math ability SD 80 709 81 70 
Personality and attitudinal scales | 
$ at; pas ee X 15.40 17.09 15.62 18.20 10.99 
Preconscious activity (originality) SD 3.26 4.51 3.98 3.96 
3 X 5.00 5.14 4.27 4.38 2.70 
e 5 2007 1.80 2.02 1.86 
j t X 11.13 12.53 11.85 12.80 3.29 
Interpersonal compe ency SD 2.75 2.76 307 310 
SAR SAI SAC SAE 
(N=16) | (V=206) | ~ =93) | (V=443) 
Competencies 
NINA X 1.81 3.46 2.76 2.62 9.71 
Stine 5 1.24 2.14 2.29 1.80 
Social and educational a 7.25 7.80 7.98 8.58 7.23 


55 2.63 2.45 2.29 2.19 


A 
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Student characteristics 


Artistic 
Leadership 

Self-ratings 
Originality 
Math ability 

Personality and attitudinal scales 
Preconscious activity (originality) 
Collegiate type 


Interpersonal competency 


— —ͤ . 


Competencies 


Scientific 

Business 

Social and educational 
Leadership 


Life goals 


Executive responsible for work of others 
Self-ratings 
Popularity 
Math ability 
Personality and attitudinal scales 
Preconscious activity (originality) 


Collegiate type 


TABLE 7—Continued 


Three. digit VPI codes 

SAR SA | SAC 
(N=16) | (N=206) | (N93) 
£i 9112 11 11106 
SD| 3% 616 | 618 
X| 387 5.13 494 
SD! (S55 |" SR |e 382 
| 
en 
SD 58 1 67 
x 2.00 1.90 2.04 
5 1.00 83 80 
X 19.87 2081 18.26 
Sp 3.44 411 | 3.71 
ao jeer Fi 4.76 4.80 
SD 1.96 1,96 2.02 
x| 12.25 11.59 11.27 
5327 3.25 3.07 
ASR ASI ASC 
(N=22) | (N=211) | (N=34) 

* 2:59 3.12 2.47 
SD 1.87 2.07 1.61 
X| 2.09 1.86 2.70 
SD 1.24 1.14 1.11 
X 7.23 7.72 8.00 
5D 2.13 2.47 2.49 
2 4.80 4.50 
SD\ 2.92 2.77 3.42 
x 1.55 1.73 2.03 
SD 72 81 71 
295 2.28 2.41 
SD 56 62 65 
x 1.77 201 1.91 
SD 85 88 95 
X 24.00 23.05 20.12 
sp| 4.01 3.97 3.98 
X 4.32 4.50 4.82 
SD 1.79 2.14 2.01 


SAE 


| (Nw 443) 


ASE 


(N= 339) 


; 


Note.—See note of Table 2 


a All F ratios are significant beyond .05 level. 


P 


3.79 


8.51 


4.95 


3.42 


4.26 


4.72 


6.21 
3.08 


19 


381 
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TABLE 8 
RELATION OF SATISFACTION WITH COLLEGE AND PERSONALITY TYPE 


(VPI Hicn-Pornt Copes) ror FALL SAMPLE 


Women (N = 1383)> 


VPI Men (N = 1353)" 
high-point | - | p= 

code Low | Middle High Low | Middle | High 
Real 19.3 (22) | 46.5 (53) 33.3 68) 0.0 (0) 0.0 (0) 100.0 6) 
Int 8.2 (45) 50.5 (277) 41.2 (226) 12.3 (28) 49.8 (113) 374 (85) 
Soc 94 (20) 50.9 (108) 38.7 (82) 12.4 (93) 44.9 (338) 42.6 (321) 
Conv 12.4 (13) 43.8 (46) 43.8 (46) 4.0 (2) 48.0 (24) 48.0 (24) 
Ent 15.0 (29) 53.9 (104) 31.1 (60) 13.9 (5) 61.1 (22) 25.0 (9) 
Art 10.2 (19) 49.5 (92) | 392 03) 17.4 (55) 47.6 (151) 34.7 (110) 


Note.—Chi-square calculated from frequencies shown in parentheses. Abbreviations: Real = Realistic; Int = Intellectual; 


Soc = Social; C 
ax? = 21.97; df = 10; p < .02. 
bx? = 21.27; df = 10; p < .02. 


TABLE 9 


RELATION OF EDUCATIONAL ASPIRATION 10 
PERSONALITY Types (HtcH-Pornt 


VPI Cones) 
Men (N=1328)* Women (N= 1363) b 
VPI 3 
high-point BA or | Advanced | BA or | Advanced 
code l 17 5 kE 
ess training less training 
(%) (%) (%) (%) 
Real 43.8 56.3 100.0 0.0 
Int 17.6 82.4 48.2 51.8 
Soc 29.1 70.9 58.6 41.4 
Cony 47.6 52.4 80.0 20.0 
Ent 31.6 68.4 68.6 31.4 
Art 16.9 83.1 45.9 SST 


Note.—See note of Table 8 for abbreviations. 
* 5.26; df 5; p <.001, 
by? = 38.74; df = 5; p <.001. 


TABLE 10 


RELATION OF COLLEGE GRADES TO PERSONALITY TYPES 
(VPI HicH-PornT Copes) 


onv = Conventional; Ent = Enterprising; Art = Artistic. 


tional, Social-Enterprising, Social-Artistic, 
Conventional, Enterprising, and Artistic (Hol- 
land, 1966a). Tables 12 and 13 indicate the 
assignment of vocational choices to vocational 
classes for men and women. Students selected 
their career choices from a coded list of 99 
careers. All classifications, as well as the 
scoring and establishing the highest VPI scale 
codes, were performed by a computer. We as- 
sumed, then, that the classifications were per- 
formed with perfect reliability. 


TABLE 11 


CORRELATIONS (RANK ORDER) BETWEEN PREDICTED 
AND OBSERVED ROLE PREFERENCES FOR THE 
PERSONALITY TYPES 


Personality types (VPI high-point codes) 


Real | Int | Soc | Conv | Ent | Art 


VPI Men Women 
a a — b 
Kiek pong (V= 1359) (V= 1386) 
code 
<C 2C+ | <C | >C+ 

Realistic 50.9 49.1 33.3 66.7 
Intellectual 43.4 56.6 42.3 ST% 
Social 50.0 50.0 45.3 54.7 
Conventional | 55.2 44.8 52.0 48.0 
Enterprising 53.9 46.1 55.6 44.4 
Artistic 44.1 55.9 44.5 SBS) 


Fall sample (1359 men; 1386 women) 


Men 90“ .60 | 90* | 1.00%" | .50 20 
Women | 75 80 .90* .90* 50 60 


Spring sample (1773 men; 2336 women) 


1.00 | 50 | .18 
1.00** | 68 | .10 


Men 70 60 .90* 
Women | .50 60 | .90* 


ax? = 1141; df = 5; p < .05. 
by? = 3.68; ns. 


* p <.05, one-tailed test. 
* p <.01, one-tailed test. 
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TABLE 12 
PSYCHOLOGICAL CLASSIFICATION SCHEME FOR Vocations (MEX) 


Realistic class 
Agricultural Science Forestry Industrial Engineering 
Architecture Geography Mechanical Engineering 
Civil Engineering Industrial Arts Education Trade & Industrial Education 
Farming 
Intellectual class 
Aeronautical Engineering Engineering; General, Other Oceanography 
Anthropology Engineering Science Other Biological Science Fields 
Astronomy, Astrophysics Geology, Geophysics Other Health Fields 
Biochemistry Mathematics Education Pharmacy 
Biology Mathematics, Statistics Physical Therapy 
Botany Medical Technology Physics 
Chemical Engineering Medicine Physiology 
Chemistry Metallurgical Engineering Veterinary Science 
Dentistry Military Service Zoology 
Electrical Engineering Natural Science Education 
Social class 
Clinical Psychology Experimental & General Psychology Industrial & Personnel Psychology 
Counseling & Guidance Foreign Language Education Physical Education, 
Education, General, & Foreign Service Recreation, & Health 
Other Specialties General Social Sciences Social Work 
Education of Exceptional Children History Sociology — 
Educational Psychology History Education Theology, Religion 


Elementary Education 


Conventional class 


Accounting Business Education Finance 


Enterprising class 


Economics Other Business & Commerce Public Relations 
Law Political Science Purchasing 
Management Public Administration Sales 
Marketing : 
Artistic class 
iti Music Education 
Art General Humanities ) - 
Art Education Journalism, Radio-TV, Other Fine & Applied Arts 
Drama Communication me 
English, Creative Writing Literature peec! 
English Education Music 
Vocational Choice and VPI * earlier is shown in Table 14 for men in the 


fall sample. The percentage of correct predic- 

The prediction of a student’s final voce tions varies from 63.2 to 71.2%. The total 
tional choice from his first choice 8 mo. umber of correct predictions always exceeds 
base-rate expectations and cannot be at- 
tributed to chance. 


3 These analyses were reported earlier as ACT 
Research Report No. 18 (Holland & Lutz, 1967). 


N 
— 
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TABLE 13 


PSYCHOLOGICAL CLASSIFICATION SCHEME FOR VOCATIONS (Women) 


Agricultural Science 
Architecture 
Biochemistry 
Biology 


Clinical Psychology 
Dentistry 
Mathematics Education 


Intellectual class 

Chemistry Other Biological Sciences 
Mathematics, Statistics Physics 
Medicine Veterinary Medicine 
Natural Science Education Zoology 

Social-intellectual class 
Medical Technology Physical Therapy 
Nursing Political Science, Government, 


Experimental & General Psychology Other Health Fields 
Pharmacy 


Social-conventional class 


Business Education 


International Relations 
Theology, Religion 


— ——ñ —fVH—. —-—:' 


Clerical, Office Work Secretarial Science 
Social-enterprising class 
Educational Psychology Purchasing Sales 
Management, Business 
Administration 
Social-artistic class 
& Guidance History Education Physical Education, 
Education, General & Home Economics Recreation, & Health 
Other Specialties Home Economics Education Public Relations, Advertising 
Education of Housewife Social Science 
Exceptional Children Law Social Work, Group Work 
Elementary Education Modern Foreign Language Sociology 
English Education Education Speech 
History 
Conventional class 
Accounting 
Enterprising class 
Marketing 
Artistic class 
Art Journalism, Radio-TV, Modern Foreign Language 
Art Education Communication Music 
Drama Library Science, Archival Science Music Education 
English, Creative Writing Literature Other Fine & Applied Arts 
Foreign Service 


Philosophy 
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TABLE 14 


Premicnox or Fixar Vocanoxat Cuoice rron Fist Vocamosat Ce 
(Fart Saure, Mex, N = 1599) 
= = — Z— — 


e S re 1 1 
| Real | Int | Soe | Conv | Est | An | ver | |n f N 


— — — at 
| — — 
106 13 4 1 n "n 69 é | 1% 
31 355 2¹ 14 Ld * * 61.6 10 gs 
| 3 8 110 i n J 19 oss +4 | @ 
1 2 42 7 7 712 » 
| 5 15 10 155 4 n 67.1 6 21 
1 2 8 “ 10 632 1 70 
4 14 13 4 18 4 SS 1 111 
| 10 8 5 8 9 1 21 
>ON — E T — 
are a commervative 


—— of bit in Table 14 and succerding tables the total N in each row aed 
imate, because — — In ure “undecided” or give ding tables e survey ‘are inchaded! te the total V. 


The prediction of a student’s final voca- a subsample of 
mal choice from his highest score (high- tional choices, at the time of initial testing, 
code) among six VPI scores, ob! 
earlier, is given in Table 15. The per- physics and chemistry, education 
tage of correct predictions ranges only work, art and ti 
tom 21.5 to 51.4%. In this instance, it is 
that simply asking the student is su- follows: 

etior to using the VPI, although both meth- Er — 
of assessing personality types have pre- ee sae Da one * panies’ 
tive validity. 

The discrepancy between the efficiency of HP) 
a student’s expressed choice and his VPI would 

ores becomes even greater when we select appropriate number from this list) 


PREDICTION OF FINAL 
(Part. Samete, MEN, N = 1359) 


——— 
Final vocational choice 


ence the 
VPI and expressed based on 
1 Ne ti predictive e 
nd = undecided. 

NR = no response. 
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TABLE 16 


PREDICTION or FINAL VOCATIONAL 


CHOICE FOR STUDENTS WHOsE First 


Two Cuorces FALL IN THE SAME CLASS 
(FALL SAMPLE, MEN, N = 586) 


Final vocational choice 


First and second 


T 
varion eoo | Real | Int Soc Cony Ent Art Und % NR N 
| hits 

Realistic (Real) | 36 2 1 6 1 76.6 1 47 
Intellectual (Int) 3 239 5 4 10 4 21 78.1 5 306 
Social (Soc) | 47 3 1 9 78.3 60 
Conventional (Conv) 12 2 85.7 14 
Enterprising (Ent) 1 2 2 69 2 9 78.4 3 88 
Artistic (Art) 3 3 22 1 73.3 30 
Undecided (Und) 4 1 2 1 5 13 
No response (NR) 9 3 Kj 6 7 28 


Table 16 reveals that subgrouping students 
whose first two choices belong to the same 
vocational class results in a substantial gain 
in predictive efficiency. Correct predictions 
range from 73.3 to 85.7% as contrasted with 
63.2 to 71.2 for the total sample of men. The 
subgroup of students whose first two choices 
fall in different vocational classes are, as ex- 
pected, less predictable from their first choice 
than students whose first two choices belong 
to the same class. Although the percentages 
of hits are lower—53.0 to 66.7—these per- 
centages still exceed the percentages obtained 
by the use of the VPI—21.5 to 51.4. 

The results for women in the fall sample 


are presented in Tables 17, 18, and 19. The 


main findings for women parallel those for 
men. The analyses for women are identical 
to those for men except for the use of a 
special classification scheme developed for 
women in which the social class is divided 
into four subclasses. This subdivision was 
made to distribute women more equally 
among the classes or types. Expressed voca- 
tional choices predict 34.8-83.8% of later 
choices (Table 17). 

The VPI in Table 18 predicts final choices, 
on the average, less efficiently than asking the 
student (0.0-81.9%). In Table 19, the for- 
mation of a subsample of women whose first 


TABLE 17 


PREDICTION or FINAL VOCATIONAL CHOICE 


FROM FIRST VOCATIONAL CHOICE 


(FALL SAMPLE, Women, N = 1386) 


Final vocational choice 
First vocational choice 5 
oc | Soc | Soc | Soc 7 % 2 | Total 
uae Int | Conv | Ent | Art Cony Ent Art Und hits NK 

Intellectual (Int) 54 19 2 2 18 2 A 15 42.5 11 | 127 
Social Intellectual (Soc Int) 10 11733 3 6 42 1 1 10 7 66.3 13 } 276 
Social Conventional (Soc Conv) 17 1 1 373.90 123 
Social Enterprising (Soc Ent) 2 8 10 1 1 34.8 1| 23 
Social Artistic (Soc Art) e ee E | 25 | 37 |83.8| 12 | 660 
Conventional (Conv) 2 1 1 55.6 2 
Artistic (Art) Tel gegen? 9 34 73 10 52.10 8 | 140 
Undecided (Und) 28 11 | 24 12 85 
No response (NR) 5 4 16 7 9| 8 
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TABLE 18 


PREDICTION or FINAL Vocanoxar Cuoice rrom VPI Han- Por Cone 
(Farr Saupre, Women, N = 1386) 


Final vocational choice 


VPI high-point code | 


| soe | Soc | Soe 8d Cen E lg % n 
| Int | Int | Conv | Ent | Art | Conv | Ent Art | Und* hits NR* | Total 
. See | | : | | 
Realistic (Real) | 2 1 00 3 
Intellectual (Int) 46 6t 4 2 55 1 17 25 20.3 13 227 
Social (Soc) 235 „ 
Conventional (Conv) | ß A 35 |100| 4] 9 
Enterprising (Ent) 7 3 16 1 21) a 00 2 30 
Artistic (Art) 12 43 4 7 139 1 1 65 | 23 20.5 | 22 | 317 
| 
»Und = undecided. 
NR = no response. 
TABLE 19 
PREDICTION OF FINAL VOCATIONAL CHOICE FOR Srupents Wuosx First 
Two Cnoicxs FALL IN THE SAME Crass 
(FALL SAMPLE, WOMEN, N = 545) 
Final vocational choice 
First and second = 
vocational choice Soc | Soc | Soc | Soc % W „ 
: ! d NR t 
Int | Int |Conv| Ent | Art Conv | Art | Und | bits ye 
— : | f — 
Intellectual (Int) 25 7 1 1 | 3 | 658) 1 38 
Social Intellectual (Soc Int) 2 85 2 2 5 1 1 | 850} 2 100 
Social Conventional (Soc Conv) 8 1 1 | 727 1 11 
Social Enterprising (Soc Ent) 1 1 50.0 2 
Social Artistic (Soc Art) Mee oe a 299 8 | 13 |885| 7 ian 
Conventional (Conv) 1 
Artistic (Art) 1 1 8 23 2 |5| 3 38 
Undecided (Und) A 1 : 
No response (NR) 1 3 > 1 5 1 16 
TABLE 20 


two choices belong to the same class yields 
the most efficient predictions—S0.0 to 88.5%. 
Again Table 19 demonstrates that students 
whose first two choices belong to the same 
vocational class are more predictable than 
students whose choices belong to different 
classes (32.6-78.9%). (The data for the 
latter analysis are not presented.) 

Because of the controversial character of 
the results obtained for the fall sample, the 
same analyses were performed for the spring 
sample with similar results. These analyses 
involve a longer interval of time (1 yr. in- 
stead of 8 mo.) and a different stage of col- 


SUMMARY FOR SPRING SAMPLE 
—— — 
% correct predictions 


— SS — 


Men Women 
(N = 1773) | (N = 2336) 


Kinds of prediction 


— 


Expressed vocational 

choice total 68.7 78.2 
VPI—highest scale 45.1 59.6 
Expressed yocational 

choice—same 82.5 86.4 
Expressed vocational 

choice—different 64.2 719 


—̃ — —-¼-— — 


26 Joux L. Horraxo 


lege life (end of freshman year to end of Without exception, the results in Table 20 
sophomore year, as opposed to beginning of replicate what we found earlier—various cate- 
freshman year to end of freshman year). gorizations of expressed vocational choice are 
Table 20 is a summary of the main results. clearly and substantially superior to the VPI, 


TABLE 21 


COMPARISON ov THE Peeoicrive Varves or Hian- Pater VPI Cone Ab Prererrep Rove, ann 
Inrrtat Vocationat Cnorcr Nh Prererrep Rote (FALL SAMPLE) 


Preferred vocational role 
U 8 | Practitioner Teacher Leader Researcher | Consultant 
| 
1 .% hits s hits | J %hits | hits , c hits / e bits 
Men (N = 1359) 
VPI 
Real 45 222) 1 4.1) 6 50011 54.5 9 33.3 9 556 
Int 155 36.1 154 57.8 35 45.7 23 30.4 | 160 66.5 22 36.4 
Soc 73 27.4] 77 3% 27 484113 462 8 37.8 14 429 
Conv 42 21.4 32 344] 3 65.7 11 9.1 4 250/13 602 
Ent @ 476) © 536] 5 0.0 30 633 5 600) 21 42.9 
Art 71 16.9] 65 22) 33.3 20 150] 13 Ti 8 250 
Total 30.5 46.9 43.5 38.9 58.8 H8 
Vocational choice 
Real 30 61.8 60 71.7) 7 57.116 75.0 | 21 57.1 | 13 84.6 
Int 129 874 171 71337 649 22 409140 765117 706 
Soc SO S28) 57 77228 «875119 S79) 13 462 | 11 81.8 
Conv 2 50.0 17 82.4] 1 1000 2 00} 2 1000 17 882 
Ent 66 89.187 724 4 75.0 43 767 7 429 24 583 
Art 22 50.0 34 64.7 11 72.7 4 1000 3 606.7 2 300 
Total 56.7 71.5 72.6 65.1 71.3 73.8 
Women (N = 1386) 
VPI 
Real 1 2 90 
Int 15.4 760 19. 17 00 12 s 50.0 
Soc 205 74.1347 87¼ 80 76.3 57 930 21 656.7 43 8144 
Conv 18 5.6 4 00 5 0.0 3 0.6 3 00 
Ent 10 00] 13 00} 2 00 7 0.0 3 0.0 | 1 0.0 
Art 88 22.7 143 109.6 37 27.0 18 16.7 11 0.0 | 20 200 
Total 48.2 57.8 50.7 57.6 34.8 57.5 
Vocational choice 
Int 34 20.6 38 63.2 5 200] 1 1000 44 409 5 60.0 
Soc Int 62 61.3 129 74.4 26 46.2 29 558.2 24 75,0 6 300 
Soc Conv 4 500| 13 759 4 1000 2 5000 
Soc Ent S 00 4 75.0 4 25.0 
Soc Art 141 75.9 75 77.357 86.0 16 62.5 37 865 
Conv 57.1 1 100.0 1 0.0 
Ent 
Art 30 35.9 59 62.7 21 52.4 5 800 2 0.0 14 50.0 
Total 50.7 20.7 64.2 75.3 53.5 69.1 


Note.—The Ns in this table represent all HE data. The Ns for totals and subtotals differ from the total sample Ns and are 
‘ad tied VPI profiles, gave unclassifiable choices, or failed to respond have been 


not internally consistent because students who h: 


omitted. The italicized percentages indicate the "expected" vocational role for each class or type. 


Undecided. Other, No Response, 


n 


Vocational Choice, VPI, and Role 


The following analyses (see Footnote 3) 
were performed to see how well we could 
| a student’s later vocational choice 
when his preferred vocational role was con- 
sidered along with his first vocational choice 
or his VPI scores, Because special vocational 
roles are associated with each environment 
(Holland, 1966, pp. 64-65), and by implica- 
tion with each type, we that more 
efficient predictions would be obtained for 
students whose personality type and prefer- 
ence for vocational role were congruent. For 
example, an Intellectual type who prefers the 
role of researcher should be more 

than the same type who prefers the role of 
supervisor. 

Table 21, a summary of 24 tables, presents 
the predictive efficiencies of a student's high- 
est VPI score and his role preference, and his 
i vocational choice and his role pref- 
erence. For men, students who prefer the roles 
of practitioner, teacher, leader, researcher, or 
consultant are more likely to give the same 
or closely related vocational choice 8 mo. 
later than are students who fail to respond, 
or who respond “undecided” or “other role.” 
Or about two-thirds of the re- 
veal that male students with explicit role 
preferences are more stable than students 
with ambiguous preferences. The results for 
women in Table 21 imply that vocational 
role adds little or nothing to a woman’s initial 
expression of vocational choice. 

The italicized percentages within the six 
classes indicate the “expected” vocational role 
for each class or type (Holland, 1966b, p. 
65). The efficiency of these theoretically de- 
rived predictions is unclear because many 
cells in Table 21 are too small to yield stable 
percentages. If the “total” percentages in 
Table 21 are, however, used as à base, the 
“expected” percentages of hits, for men, ex- 
ceed the corresponding 
out of 10 times! For instance, 
peaks on Social and preferences 
teacher have a hit rate of 48.1% whereas the 
hit rate for “teacher” generally is 43.5. The 
results for women are again ambiguous for 
Several reasons. Both the VPI and the clas- 
sification scheme create many small cells, and 


Turoxy or VOCATIONAL Chois 


* 


TABLE N 


Resatios on Pessomasery Teves (ieee Poss VFI 
Cogs) vo Stasssty or Vocarenat C 
(Pass Sarta 


High puit VPI onie 


Fi ont 
= — 
$ Reij toe | Se Come | Baa | An | Te 
Mee CY ~ 10) 
o chanar | sas ses orol oes | eae | tha) ste 
strachem pHo me iaol Mapi 1 pe 
marciss = | ANE |836 420) OTF yo ma) oe 
Wess O = bite) 
Wocap | 33.2 ene See 2 t 
t ol ési? 6o | 9) Be uns 
intere tase en | aha | a7 mo BAS J63 | oe 
1 


rr 
byt = 4007, df = eee. 


it is unclear how to derive a single set of 
explicit predictions from the vocational-choice 
classification for women. 

The efficiency of the predictions obtained 
for expressed choice and vocational role ap- 
proximates those obtained by subgrouping 
whose first two choices belong to the 


his vocational intentions and then 
for his preferred vocational role. Either 
these methods is almost twice as efficient 
the VPI. 


Degrees of Change 


In Table 22, the relation of personality type 
VPI code) to 


classes. 

Table 22 reveals that, for women, some per- 
sonality types are more stable than others— 
notably the Social and Conventional types. 
The results for men are not significant and 


28 Joux L. HoLLAND 


fail to support earlier findings for high- apti- 
tude students (Holland, 1962). 


Personality Patterns 


The theory assumes that the consistency 
and the homogeneity of a student’s VPI pat- 
tern, or personality pattern, are indicative of 
the stability of his initial vocational choice 
(Holland, 1966b, p. 43f, p. 76f); that is, 
consistent patterns should be more stable 
than inconsistent patterns, and homogeneous 
profiles should be more stable than heteroge- 
neous profiles, 

Tests of the consistency hypothesis are 
shown in Tables 23 and 24 for men in the 
fall and spring samples. In 11 out of 12 com- 
parisons, the percentages of hits for students 
with the same VPI code favor the students 
with consistent codes. Consistent Realistic 
types are more stable than inconsistent Re- 
alistic types, etc. However, only 4 out of the 
12 comparisons are significant at the 5% 
level (one-tailed test). A new test with a 
larger N is needed to clarify these results. 


The results for women are not only not sig- 
nificant but not worth reexamination in the 
context of this theory. 

In Table 25, students were categorized as 
having VPI profiles of high, middle, or low 
homogeneity (the difference between the high- 
est and lowest VPI raw score), then compared 
for their stability of vocational choice. For 
men, the results clearly support the hypothesis 
—homogeneity is positively related to sta- 
bility, or, spiked profiles are stable and flat 
profiles are unstable ( < .05). The results 
for women are not significant. 

Another test of the homogeneity hypothesis 
consisted of examining the stability of voca- 
tional choice for students whose VPI profiles 
had (1) “no ties”—a single scale was higher 
than any other scale, (2) the two highest 
scales were “tied”—both had equal scores, or 
(3) the three or more highest scales were 
tied. The results of this analysis, shown in 
Table 26, were statistically significant and 
according to expectation for men. Again, the 
results for women are not significant. 


TABLE 23 


CONSISTENCY OF THE PERSONALITY PATTERN (Two-Dicrr VPI CODE) AND STABILITY OF 
VOCATIONAL CHOICE ATER 1 YEAR (FALL SAMPLE, MEN, N = 1359) 


Final vocational choice 


VPI 
high-point code % 
Real | Int Soc | Cony Ent Art | Unda hits | NR | Total 
1 
Consistent two-digit VPI codes 
Realistic (Real) 34 22 4 7 4 10 40.9 2 83 
Intellectual (Int) 55 182 12 3 22 9 38 54,3¢ 14 335 
Social (Soc) 3 13 49 9 22 9 23 37.7 2 130 
Conventional (Cony) 8 11 4 30 22 9 34.9 2 86 
Enterprising (Ent) 6 9 15 16 76 2 26 49.4 4 154 
Artistic (Art) 6 28 27 35 38 32 22.6 2 168 
Inconsistent two-digit VPI codes 
Realistic 9 6 5 4 7 29.0 31 
Intellectual 15 100 33 6 21 5 32 46.7¢ 2 214 
Social 5 19 29 12 2 14 35.4 1 82 
Conventional 3 8 ki 1 4 15.8 19 
Enterprising 3 7 2 2 22 3 56.4 39 
Artistic 2 3 1 1 6 2 3 11.1 18 


a Und = undecided. 
NR = no response. 
ex? = 3,00; df =1; p <.05, one-tailed test, 


i 
7 
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TABLE 24 
t CONSISTENCY OF THE PERSONALITY Patrern (Two-Dicrt VPI Cope) axo STAmiLITY OF 
2 VOCATIONAL CHOICE AFTER | Year (Sratmnc Saurtx, MEN, N = 1773) 
| Final vocational choice 
VPI high-point code | ie i j 7 
Real | Int | Soc | Conv | ent | art | und | ,% | NR | Total 
. | = | | 
Consistent two-digit codes 
Realistic (Real) 71 57 15 11 21 1 15 36.8" 2 193 
Intellectual (Int) 62 245 33 6 25 19 23 58.2 8 421 
Social (Soc) 6 18 84 2 24 14 13 | S515 2 163 
Conventional (Cony) g 5 6 H 35 1 2 | 36.6 2 93 
Enterprising (Ent) 11 15 24 15 78 8 8 48.85% 1 160 
Artistic (Art) 5 28 36 1 28 59 18 33.0 4 179 
| — — — — 
| Inconsistent two-digit VPI codes 
n — — 

A Realistic 26 16 25 2 12 1 | 18 25.85% 2 102 
Intellectual 13 115 38 7 29 7 | 14 81.10 2 225 
Social 15 16 67 1 22 5 10 47.5 5 141 
Conventional 4 6 4 4 1 | 2 19.0 21 
Enterprising 14 12 8 3 15 2 + 25.9 58 
Artistic 1 8 2 1 2 2 | 1 11.8 17 

ay? = 3.82; df =1;p < .05, one-tailed test. 
by? = 2.99; df =1;P < 05, one-tailed test. 
ex? = 9,05; df = 1; p < 01, one-tailed test. 
Although this second analysis did not use Combinations oj Consistency, Homogeneity, 
the standard definition of homogeneity—the and Role 
difference between highest and lowest VPI It follows from the theory that students 
scale scores—it does suggest that the relative hose VPI profiles are consistent and homoge- 
sharpness of a profile, as assessed by the crude neous, and who express a vocational role 
process of counting the number of tied high preference other than “undecided, no re- 
points in a profile, is indicative of a student’s sponse, or other role” will be more predictable 
stability of vocational choice. Taken together, than students who have inconsistent and 
the analyses in Tables 25 and 26 imply that heterogeneous profiles, and who fail to ex- 
the construct, homogeneity, has some validity press a role preference. To learn if the effects 
for men, but not for women. 
TABLE 26 
TABLE 25 HOMOGENEITY OF A STUDENT'S VPI PROFILE AND 
HOMOGENEITY OF THE PERSONALITY PATTERN (VPI Sranmrry or His VOCATIONAL CHOICE 
PROFILE) AND STABILITY OF VOCATIONAL (FALL SAMPLE) 
CHorce (FALL SAMPLE) 
Men (N = 1402)» Women (N = 1427)” 
Men (N = 1359) [Women (N = 1386) G8 5. 
Hong geneity Hits Misses Hits Misses 
geneity Hits Misses Hits Misses 
f % e % Ff | % 
Flee eet |e — No ties 816 | 67.6 | 391 | 32.4 | 893 | 71.0 | 365 | 29.0 
3— iltenro aan (470 i 3 53 80|66.7| 40 33.3 
High 201 | 46.4 | 232 | 53:6 | 248 | $29 | 320 3 G oE aore 40 $2.6 36 474 34 094 | 1530.5 
ate 168 3728 217 62.2 205 40. | 212 504 Total 933 469 1007 420 


ay? = 6.81; df = 2; 5 < 105; 
by? = 3.31; ns. 


ay? = 7.43; df = 2; p <05. 
by? = 1.04; ns. 
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TABLE 27 


PREDICTION OF FINAL VOCATIONAL CHOICE FROM STUDENT'S VPI PROFILE AND ROLE PREFERENCE 


| 


Samples (males only) 


VPI profile U - = 

i 1 Biers Fall Spring Fall-Inte 
Consistency | Homogeneity | | % hits | f % hits f % hits | f 
C +f. ‚ Er e L: | 305 53.4 509 68.2 107 
7 ela a eE 
E 4 75 Pie E ci 30 1060 37.3 233 475 50 
I + Lo + Explicit = 45.0 $ 5 56 
C + Hi + Ambiguous = 33.3 126 30.5 124 46.4 13 
I + Hi + Ambiguous = 26.2 42 32.1 56 30.0 10 
C + Lo + Ambiguous = 31.8 195 30.1 156 27.1 13 
1 a Lo + Ambiguous = 25.6 | 86 28.9 90 19.4 6 


a Consistency, 
b Inconsistency. 
¢ Fall-Intellectual. 


of consistency, homogeneity, and vocational 
role are cumulative so that their occurrence 
in combination results in more efficient pre- 
dictions, all possible combinations of these 
variables were tried. Table 27 presents the 
results of these analyses for men in the fall 
and spring samples. 

The results suggest that consistency, homo- 
geneity, and vocational role preference cumu- 
late to improve the prediction of vocational 
choice, but their relative contributions appear 
very unequal. Explicit role preference ap- 
pears to contribute most to the prediction, 
followed next by homogeneity, and last by 
consistency. 

The predictions in Table 27 can be im- 
proved by controlling for VPI high-point 
code, that is, by recalculating the analyses in 
Table 27 for a single high-point code at a 
time. These analyses were done, but they re- 
quire large Ws to insure reliable percentages. 
For the largest subsample—male students who 
peaked on the Intellectual scale of the VPI— 
the results are especially clear and are shown 
in the last column of Table 27. Similar 
analyses for the smaller remaining subsamples 
produced similar but less-consistent results. 
(It is instructive to note that the best predic- 
tive combinations for men and women in 
Table 27 are either equal to, or less than, 
the predictive validity of a student’s first 
vocational choice in Tables 14 and 17.) Aside 


from their theoretical value, the results in 
Table 27 reinforce some common interpreta- 
tions about vocational counseling clients; 
namely, the row at the top of the table 
characterizes the people vocational counselors 
tarely see, whereas the bottom row charac- 
terizes the kinds of people counseling agencies 
usually deal with; that is, clients who pro- 
duce confused interest profiles and who are 
unclear or uncertain about the vocational 
role they want to play. 

To summarize, the predictive data in 
Tables 14 through 27 are strong evidence that 
the types and their associated constructs— 
vocational role preferences, consistency, and 
homogeneity—generally yield efficient and 
theoretically relevant predictions for men but 
only occasionally for women. At this point, 
the efficiency of self-expression of vocational 
choice suggests that a more thorough explora- 
tion of the general usefulness of this tech- 
nique for defining types is desirable. Simi- 
larly, a redefinition of consistency could be 
achieved by the use of a person’s first and 
second vocational choice, and this new defini- 
tion appears now to have more potential than 
the definition of consistency by the VPI. 


INSTITUTIONAL INFLUENCES 


The following analyses were performed to 
learn if a college's influence upon a student's 
choice of vocation could be predicted from 
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the theory by characterizing students and col- 
leges in terms of personality types and en- 
vironmental models, and then testing the ap- 
propriate hypotheses (Holland, 1966b, p. 
72f). More explicitly, the analyses are con- 
cerned with the stability of vocational choice, 
or whether or not a student switches from one 
vocational class to another. A second group 
of analyses was performed to learn if a stu- 
dent’s satisfaction with his college is pre- 
dictable within the context of the theory. 


Assessment of Types and Environments 


In these analyses, a student’s initial voca- 
tional choice was used as an index of his 
personality type (see Tables 12 and 13) and 
as a control for student input. The student’s 
initial vocational preference is assumed to be 
a powerful method for equating students in 
the same vocational class across colleges: 
first, an earlier study (Astin, 1965) revealed 
that a college freshman’s vocational choice 
and initial major field were the best predictors 
of his vocational choice at graduation and 
that other variables added only .01-.03 to 
the multiple correlation. Second, several anal- 
yses were performed to examine the adequacy 
of initial vocational choice as a control varia- 
ble. For this purpose, average high school 
grades and family income were related to 
initial and final vocational choices in the fall 
sample for men and women. The median tests 
(2 X 6 chi-squares) revealed that a student’s 
family income and average high school grades 
were significantly related to both his first and 
last vocational choice. And, for women, high 
school grades were related more to first than 
last vocational choice. For men, the converse 
was true, but the difference was trivial (x? 
= 16.5 versus 19.7). For women, family in- 
come was related more to first than last voca- 
tional choice; for men, income was related 
more to last than first choice, but when the 
chi-squares were converted to contingency 
coefficients (C = .14 versus C = .18) only a 
slight difference was obtained. 

Taken together, these analyses suggest that 
a student’s initial vocational choice is a sub- 
stantial control that usually incorporates the 
variance contributed by control variables such 
as grades and income. In addition, initial 
vocational choice incorporates and surpasses 


the contribution that the VPI makes to pre- 
diction (see earlier section on prediction). 

The institutional environment for each col- 
lege was characterized by the Environmental 
Assessment Technique (EAT) (Astin & Hol- 
land, 1961). The assignment of major fields 
to the six environmental models (Realistic, 
Intellectual, Social, Conventional, Enterpris- 
ing, and Artistic) was made according to the 
revised criteria (Holland, 1966b, pp. 122- 
124). This particular version of the EAT is 
only slightly different from earlier versions. 
The source of the basic data was Wright 
(1966). 

The computation of EAT scores was as 
follows: For each college, the number of de- 
grees earned in each field were assigned, by 
the criteria indicated earlier, to one of the 
six environmental models. The absolute num- 
bers for each environmental model were con- 
verted to percentages of the total graduating 
population for that college. The resulting set 
of six percentages forms the environmental 
profile for a college. For example, a college 
of 200 graduating seniors might have the 
following profile: 


Type Number EAT score 
(percent) 
Realistic 20 10 
Intellectual 8 4 
Social 12 6 
Conventional 128 64 
Enterprising 28 14 
Artistic 4 2 
Total 200 100 


Stability of Vocational Choice 


Several analyses were performed to test 
the hypothesis that a student is more likely 
to maintain his vocational choice if he lives 
in a “congruent” environment—that is, Real- 
istic types will be more stable in Realistic col- 
lege environments; Intellectual types in In- 
tellectual environments, etc. (Holland, 1966b, 
p. 73f). First, for each sex at each college, 
6x6 tables, based on the classification 
scheme summarized earlier, were formed to 
show the relation of first to last vocational 
choice for time intervals of 8 or 12 mo. for 
the fall and spring samples. Second, per- 
centages of stability for each vocational class 
at each college were computed. Third, a total 
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TABLE 28 
CORRELATION (RHO) BETWEEN THE SHAPE OF THE 
CoLLEGE Prorite (EAT Cope) AND THe Sranturry 
or STUDENT VOCATIONAL CHOICES IN THE 
CORRESPONDING AREAS OF CHOICE 


Spring sample 


College Men Women 
A 03 35 
B 11 | —.35 
E 4 1,00 
D 64 | — 

E 16 1.00* 
F 14 1,00° 
G 20 | 1.00 
H Al —.50 
I —.47 —.50 
J 94% 80 
K —.07 80 
L | 21 50 
M 24 88 
N — 84 80 
0 47 1.00 
P —.03 — 40 
0 16 1.00 
R 14 95 
8 —.03 80 
oy 26 80 
U — 1.00 
v — | 13 
Fall sample 

College | Men Women 
A | 71 85 
B 03 65 
C .09 — 

D .20 1.00* 
E 20 80 


Note.—The significance levels for women vary because the 
number of vocational classes at individual colleges varies from 
two to four. For men, the number of vocational classes always 
equals six so that all rhos are based on an N of six. 


p< 01, 


percentage of stability was calculated for 
each college, that is, the percentage of ail 
students who reported either identical choices 
or choices that belonged to the same voca- 
tional class on both occasions (no changers 
plus intraclass changers). Fourth, a college’s 
EAT scores were ranked from one through 
six or from highest to lowest percentage of 
graduating seniors in each of six areas of 
study. Finally, the percentages of stability 


for the six personality types and the six EAT 
scores were intercorrelated (rho) for each col. 
lege. The results of these analyses are shown 
in Table 28; a positive correlation means that 
high percentages of stability for a given 
type are associated with high percentages 
of students graduating in the same area. Con- 
versely, low percentages of stability are as- 
sociated with low percentages of students 
graduating in the same area. Put another way, 
the positive correlations suggest that a stu- 
dent is more likely to maintain his present 
vocational choice if he attends a college in 
which his choice is congruent with the most 
popular fields defined according to the present 
six-category scheme. In contrast, a student is 
more likely to change fields if he attends a 
college in which his choice is atypical or un- 
popular. 

The use of a scale of vocational stability 
(six percentages) and a scale of environments 
(six percentages) instead of a dichotomy of 
congruent and incongruent interactions is a 
comprehensive and severe test of the con- 
gruency hypothesis. Many percentages were 
based on small Ns and are markedly unre- 
liable; the expectation that these two sets of 
percentages will be ordered in identical fashion 
is indeed a grandiose hypothesis. 

In Table 28, only 5 out of the 50 correla- 
tions are significant beyond the 5% level, but 
41 out of the 50 are positive, an outcome 
that is statistically significant by sign test 
and congruent with the hypothesis. The mag- 
nitude of the relationship, however, is am- 
biguous, for the rhos are based on Ns of only 
six for men and two to four for women. At 
best, the results imply that the congruency 
hypothesis has some tenuous validity. 

To secure a more explicit test of the con- 
gruency hypothesis, we correlated the per- 
centages of vocational stability for each type 
with the rank of the corresponding environ- 
mental model (EAT variable) across col- 
leges. (The rank of a given EAT variable 
within a college was reranked across colleges.) 
Because the between-college analyses in Table 
29 are concerned with only a single pair of 
student-college variables, they are more easily 
interpreted than the within-college analyses of 
Table 28, in which each correlation involves 
a simultaneous test of six pairs of student- 
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college variables. Although only 4 out of the 

20 correlations (rho) in Table 29 are sig- 

nificant, all 4 are positive in direction. Taken 

together, all statistically significant findings 

in Tables 28 and 29 support the congruency 

hypothesis: students maintain their voca- 

tional choices when their college tends to be 

minated by students whose vocational 

pices belong to the same gı class. 

| To test the hypothesis that homogeneous 

environments are “conducive to vocational 

stability” (Holland, 1966b, p. 67), the per- 

centage of stability for all students at a col- 

lege and the homogeneity of a college’s en- 
vironment (the percentage difference between 
the largest and smallest EAT scores) for sam- 
ples of males and females in the spring and 
fall samples were intercorrelated. The cor- 
relations (product-moment) in Table 30 are 
all positive and support the hypothesis, but 
only one out of the four correlations is sta- 
tistically significant. The results imply that 
students are more apt to maintain their orig- 
al choice when they attend colleges with 
well-defined or spiked profiles, and that stu- 
denis are more apt to change their vocational 
piration when they attend colleges with flat 
profiles or colleges with about equal numbers 
students in the six vocational classes de- 
ed earlier. 

The results in Table 30 can also be ex- 
ained as simply the outcome that would be 
pected from common knowledge about the 
of an educational institution; that is, a 
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student will tend to remain in his field of 
initial choice in a small college because there 
are fewer alternatives, and he will tend to 
change field in a large college because there 
are many alternatives. To test the validity of 
this rival hypothesis, stability of vocational 
choice at an institution was correlated with 
its size. These results are also shown in 
Table 30, and they imply that size is a less 
plausible hypothesis than homogeneity is. The 
single signiñcant correlation supports the size 
hypothesis, but the remaining three correla- 
tions are not signiñcant and in the wrong 
direction. In this regard, it should be noted 
that “homogeneity” is a complex construct 
that incorporates size as well as variability 
among environmental variables (EAT); that 
is, large and diverse institutions are apt to 
be “heterogeneous” while small institutions 
are apt to be “homogeneous.” 
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Satisfaction with College 


To test the hypothesis that a student’s 
satisfaction with his college will be greater if 
his type is congruent with his college environ- 
ment (Holland, 1966b, p. 73), initial and 
final mean satisfaction scores for students of 
different types were correlated with the ranks 
of the environmental models of a college. 
More explicitly, initial and final mean satis- 
faction scores were computed for the six types 
at each college. Then the means for the six ini- 
tial and final satisfaction scores were ranked 


one through six. Last, ranks for initial and 
final satisfaction were correlated with the | 
ranks for a college’s EAT (one through six), 
The rhos for these analyses are presented in 
Table 31. The results for men clearly negate 
the hypothesis—satisfaction with one’s col- 
lege is associated as much with incongruency 
of personality types and college environment 
as it is with congruency. 

The results for women appear to support 
the hypothesis—satisfaction with one’s col- 
lege increases if one’s personality type tends 
to be congruent with the more popular areas 
of study at the college. For women, 17 out of 
25 pairs of correlations for initial and final 
satisfaction increase over time—trends that 
suggest congruency of student and college 
lead to greater satisfaction. Reliance upon 
the increase in correlation between initial and 
final satisfaction serves as a useful control 
for initial level of satisfaction. 

In the final analysis, the average final satis- 
faction score for all students at a college was 
correlated with the homogeneity score for 
their college environment. Other things being 
equal, the results in Table 32 support the 
hypothesis that students are more likely to be 
satisfied in colleges characterized by their 
flat profiles—about equal percentages of per- 
sonality types. Conversely, students are likely 
to be less satisfied in colleges with a well- 
defined or homogeneous student body. These 
results fit the theory that a student is more 
likely to find friends if he attends a large, 
diverse institution (where there will be many 
students of diverse types) than if he attends 
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CORRELATION (PRODUCT-MOMENT) OF HOMOGENEITY 
AND SIZE OF COLLEGE WITH 
STUDENT SATISFACTION 


Spring sample Fall sample 
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small, selective institution (where there will 
be fewer students of only a few types). 

The correlation between institutional size 
and satisfaction with college is also shown in 
Table 32. In general, size appears to be a 
slightly less reliable predictor of satisfaction 
than homogeneity. 

The findings in Tables 31 and 32 imply that 
a student’s satisfaction with his college fol- 
lows the following theoretical rules: 


1. Within the hierarchy of six types of stu- 
dents at a college, the higher a student's 
choice stands in that hierarchy, the greater 
the likelihood of satisfaction, This is simply a 
technical way of saying that students are 
happier the more closely they resemble the 
majority of students. 

2. Students are more likely to find satisfac- 
tion in colleges that possess about equal num- 
bers of each personality type. The assumption 
here is that heterogeneous colleges have a 
diverse student population so that a Student's 
opportunity for finding congenial friends is 
greater than it would be at a college populated 
largely by a single kind of personality type. 

3. The most satisfied student should be a 
student who resembles the most popular type 
at his college and whose college has a hetero- 
geneous profile of types—the distribution of 
types is about equal. 

4. The most dissatisfied student should be 
a student who resembles the most unpopular 
type at his college and whose college has a 
homogeneous distribution of types—the dis- 
“tribution of types is very unequal so that a 
single type dominates the college with only 
weak representation of the remaining types. 
Thus, his chances for finding congenial friends 
are less than they would he in a heterogeneous 
student body. 


Taken together, the applications of the the- 
ory to student-college interactions in Tables 
28-32 were a partial success. These analyses, 
however, clearly reveal the need for replica- 
ti colleges and for 
aore complete student samples at individual 
colleges. Further, the findings that contradict 
theory require clarification. 
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findings to more diverse populations 
although the typical college student is an 
provement over the typical Nat 
finalist—the source of data for 
earlier investigations. 
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1962, 1963, 1963-64, 1964; Holland & 
Nichols, 1964). If anything, the results of 
the present study are more consistent with 
the hypotheses in the theory than were the 
results of earlier studies. For example, 


consistent with one another in that all studies 
But be- 


largely because they were not possible or ap- 


for replication of 
the findings about institutional influence are 
tenuous. In addition, large numbers of col- 
leges are required to test all of the main hy- 
potheses in the theory. In the present study, 
the small sample of colleges precluded the 
test of many hypotheses and it limited con- 
fidence in the outcomes. At the same time, 
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tive efficiency of the analyses of students 
whore vocational choices belonged to the 
same or different vocational classes was s 
partial test of this idea. 

Second, the theory needs several revisions. 
A person's preference for various vocational 
roles needs to be clarified, for the assignment 
of roles to types is often contradicted by the 
evidence, Another more extensive revision of 
the theory is needed to provide more con- 
sistent results in the case of women. The use 
of the special classification for women im- 
proved the prediction of their vocational 
choices, but many other analyses are still 
significant for men but not for women 

Third, the success of the VPI as a das- 
sifying tool for assigning occupations and 
fields to classes implies the value of both 
making the classification more comprehensive 
and studying more subclasses, The extension 
of the classification scheme (Holland, 1966a) 
can be performed simply by assessing repre- 
sentative samples of occupations with the 
VPI, computing means, and assigning the cc. 
cupation to the appropriate class in terms of 
its coded mean VPI profile. 

Last, the partial success of the theory as 
a way to interpret student-institutional inter- 
actions implies that employee-organizational 
interactions and outcomes such as tenure and 
job satisfaction may follow the same prin- 
ciples. The typological and environmental 
formulations of the theory would provide a 
way to control and interpret such interactions. 
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TEAM PERFORMANCE AS A FUNCTION OF TEAM 
ARRANGEMENT AND WORK LOAD 


WILLIAM A. JOHNSTON anp GEORGE E. BRIGGS 


Ohio State University 


Pairs of Ss served as radar controllers in a simulated approach-control task, 
and they were required to alternate in directing aircraft approaches. Pre- 


journal of applied psychology 


VoL. 52, No. 2 


Aprit 1968 


scribed approach rate was 1 approach every min. (high work load) or every 
2 min. (low work load). S could compensate for his partner's early or late 
approaches in compensatory teams but not in noncompensatory teams. Each 


team completed 4 experimental sessions. 


Standardized algebraic average of 


approach times was closer to criterion in compensatory teams than in non- 
i compensatory teams, particularly under 

flight errors occurred in compensatory teams than in noncompensatory teams. 
performance only in the noncompensa- 


Team communication inhibited team 


others enhance, team output. 


N The available evidence favors the conclu- 
sion that team output is an inverse function 
of the extent to which teammates are Te- 
quired to coordinate, communicate, or other- 
wise interact with one another (Briggs & 
Naylor, 1965; Johnston, 1966; Kidd, 1961; 
Lanzetta & Roby, 1956, 1957; Naylor & 
Briggs, 1965; Williges, Johnston, & Briggs, 
1966). An important implication of this con- 
clusion is that such team functions should 
be minimal and individual functions maximal 
in the design of multiman systems. However, 
the previous investigations have neglected at 

east two potentially useful team functions: 
the fail-stop and compensatory functions. The 
former is one in which a team member pre- 
vents his partner from committing an error, 
and the latter is one in which a member cor- 
rects or counteracts his partner’s error after 
it has been committed. 


1 This research was carried out at the Human 
Performance Center and was supported in part 
by the United States Navy under Contract No. 
N61339-1327, sponsored by the United States Naval 
Training Device Center, Orlando, Florida. Repro- 
duction of this publication in whole or in part is 
permitted for any purpose of the United States 

Government. The authors are grateful to Robert 
g C. Williges for his help in analyzing the data. 
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low load. Under high load fewer 


tory high-load condition. It was concluded that some team functions hinder, 


These team functions would seem to be 
particularly valuable in systems in which 
errors can eventuate in disaster. For example, 
one can well imagine the fatal consequences 
which would ensue if fail-stop and com- 
pensatory team functions were not incorpor- 
ated ‘into surgical and trapeze teams! of 
course, the utility of these functions should 
vary with system load. For example, the 
frequency of individual errors and therefore 
the advantage of the fail-stop function should 
be directly related to the load on the system. 
On the other hand, the opportunity for com- 
pensatory activity should be an inverse func- 
tion of system load; the higher the load, 
the more pressed a team member is to com- 
plete his own task, and the less time he has 
to compensate for a teammate’s errors. The 
present study explored. the thesis that com- 
pensatory and fail-stop team functions can 
facilitate team output, at least under certain 
conditions of system load. 

In addition, the present study sought to 
further assess the inhibitory team function of 
intermember communication (Williges et al., 
1966). The effect of this team function should 
also vary with system load: the greater the 
load, the less a team can afford the luxury of 
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team communication, and so the greater 
Should be the disruptive effect of such com- 
munication. 


METHOD 


Task and apparatus. A simulated approach-con- 
trol task was used in which subjects (Ss) served 
as radar controllers (RCs) who were responsible 
for directing aircraft through an approach gate. 
The approaches were to be effected at a prescribed 
rate, and two Ss were required to alternate in 
controlling approaches, That is, the first approach 
was to be directed by RC, the second by RCs, 
and then RC, assumed responsibility for the third 
incoming aircraft, and so on. 

The radar displays were linked to target-generator 
consoles via a special-purpose analog computer 
(Hixson, Harter, Warren, & Cowan, 1954). The 
role of pilots was assumed by well-trained experi- 
mental assistants who worked at the target-generator 
consoles. The RCs delivered instructions to pilots 
Over a voice-communication channel, and the pilots 
carried out the instructions faithfully and without 
delay by appropriate manipulation of their con- 
soles. The RC-to-pilot communication protocol 
required that the RC first identify the pilot and 
then issue the command, for example, “Bravo one, 
speed 200 knots”; in return, the pilot was to im- 
mediately confirm the command, for example, 
“Roger; Bravo one, speed 200 knots.” The RCs 
gave only heading and speed commands to the 
pilots; altitude was intentionally omitted from 
consideration in order to maintain a reasonable 
level of task difficulty, 

One of the displays as it appeared at the 
beginning of a session is reproduced in Figure 1, 
The approach gate was located precisely at the 
center of the displayed airspace, and all aircraft 
entered the airspace from the eastern (right-hand) 
periphery. The display was marked with con- 
centric rings, and the distance between adjacent 
rings represented 20 mi, The distance from the 
periphery to the approach gate represented 100 mi. 
The aircraft appearing in the northeast sector of 
the scope were designated as “Alpha” aircraft, and 
those appearing in the southeast sector were referred 
to as “Bravo” aircraft. The Alpha and Bravo air- 
craft were indicated by different codes. The two 
RCs monitored the same airspace but on different 
displays, and they could speak to one another only 
over a voice-communication channel. The Alpha 
RC was assigned to the Alpha aircraft, the Bravo 
RC to the Bravo aircraft, The team-communication 
protocol required that both RCs identify them- 
selves before each conversation. All team com- 
munications were tape-recorded, A partition was 
erected between the two RCs so they could not 
see one another, and an ambient noise generator 
prohibited the RCs from hearing one another ex- 
cept by means of the inter-RC communication link, 

At the beginning of a session, four Alpha air- 
craft and four Bravo aircraft were spaced evenly 
along the eastern periphery of the airspace. An 
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Fic. 1. Reproduction of a display at the start 
of a session. 


Alpha aircraft made the first approach, a Bravo 
aircraft followed, then another Alpha aircraft, and 
so on. A successful approach occurred when an 
aircraft entered the approach gate at 200 kn. on 
a heading of 270°. A “miss” occurred if an aircraft 
in the inner circle of the display crossed the 
longitudinal axis into the western half of the 
airspace in any condition not constituting a successful 
approach. Aircraft were removed from the display 
following successful landings or misses, and as soon 
as one aircraft was removed a new one was posi- 
tioned at the eastern periphery. A safety infraction 
occurred if two aircraft came within 2 mi. of one 
another. However, a safety infraction did not lead 
to the removal of the aircraft. 

The entire operation, including the scoring and 
recording of approaches, misses, and safety infrac- 
tions, the presentation and removal of aircraft, and 
the administration of feedback, was conducted by 
a team of two experimental assistants, 

Subjects and design. The Ss were 64 male under- 
graduates who served in pairs for five consecutive 
sessions. All were paid for their services at the rate 
of $1.25 per session, Four conditions were formed 
from the factorial combination of two levels of team 
arrangement (compensatory and noncompensatory) 
with two levels of system load (low and high). 
Teams were composed of Ss who had volunteered 
for the same experimental time period, and they 
were randomly assigned to the four conditions 
(groups) with the restriction that groups be filled 
equally across the 8 wk. of data collection. 

Procedure. The first session was devoted to 
general instructions and practice, and the experi- 
mental conditions were implemented during the 
remaining four sessions. Each session required 35 
min., and the intersession interval was 24 hr. : 

The required approach rate (system criterion) 
Was an approach every minute for high load, and 
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one every 2 min. for low load. In the non- 
compensatory arrangement, the system criterion was 
to be satisfied on each approach. In the low-load 
condition, for example, a given approach was to 
be effected precisely 2 min. after the preceding air- 
craft passed the gate regardless of any time error 
that might have accrued over prior approaches. By 
contrast, a given approach in the compensatory 
arrangement was to compensate for the accumulated 
time error. Hence, if a time error of 20 sec. 
late had accrued over prior approaches, the next 
approach was to be 20 sec. early so that the 
average of the approach times would equal the 
system criterion. 

A clock system was mounted on the wall in front 
of the team in such a way that the RCs could keep 
continuous track of their temporal progress in 
guiding aircraft through the approach gate. Each 
clock kept time in terms of minutes and seconds 
up to 1 hr. and could be viewed by only a slight 
shift in an RC’s line of vision from the input dis- 
play. Only one clock was used in the compensatory 
condition, and the RC team was instructed to have 
an aircraft through the approach gate at every 
one (high load) or two (low load) complete ro- 
tations of the second hand. In order to avoid 
anomalies that arise when unusually large time 
errors accrue over previous landings, the clock was 
restarted after every six approaches. 

Two clocks were used in the noncompensatory 
condition: one for the Alpha RC, the other for the 
Bravo RC. When the Alpha RC made an approach 
(regardless of how early or late the approach might 
have been), the Bravo RC’s clock would start and 
a Bravo aircraft was to be at the approach gate 
after either one (high load) or two (low load) com- 
plete rotations of the second hand. Then, when the 
Bravo aircraft passed through the approach gate, 
the Alpha RC’s clock (which had been reset during 
the Bravo approach) would start again, and so on. 

The following aspects of the task characterized 
both team arrangements: (a) A small red light in- 
dicated whose turn it was to effect an approach; 
(b) the timing started upon the completion of the 
first approach of the session; and (c) immediate 
feedback was provided to the RC team of time 
errors (relative to the compensatory or non- 
compensatory criteria), misses, and safety infrac- 
tions immediately after each approach except the 
first. 

Performance measures. The experimental timing 
of each approach began when the immediately 
preceding aircraft passed through the approach 
gate. Approach-time error was defined as the dif- 
ference between actual approach time and desired 
average approach time (actual minus 60 sec. or 
120 sec. depending on the system criterion in effect). 
Thus, a +20-sec. time error indicated an approach 
that was 20 sec. late (eg. an approach time of 
either 80 sec. or 140 sec.). Both the algebraic and 
the absolute sums of the time errors were calculated 
for each team on each of the last four sessions. 
The algebraic or constant error (CE) score was 


then divided by the absolute or variable error 
(VE) score to provide a single measure for each 
team on each session. 

The CE/VE ratio is analogous to the 2 or stand- 
ard score; thus, it is a way to compare CEs from 
populations differing in variability. In the present 
study the CE/VE ratio has special implications for 
the compensatory versus noncompensatory com- 
parisons. In effect, the instructions to the com- 
pensatory teams emphasized small CEs (at the 
expense of VE), while the instructions to the 
noncompensatory teams, in effect, emphasized small 
VEs (at the expense of CE), Relative to the non- 
compensatory arrangement, therefore, the com- 
pensatory arrangement was expected to reduce CE 
but to elevate VE. In short, smaller ratio scores 
were expected under the compensatory condition 
than under the noncompensatory arrangement, Fur- 
ther, this difference was expected to be particularly 
pronounced under low system load because this 
condition provides minimal constraints on com- 
pensatory activities. 

Each team was scored also in terms of total 
number of flight errors (misses and safety in- 
fractions) per session. It was anticipated that an 
RC would pay more attention to his partner’s ac- 
tivity, and would therefore be more likely to pre- 
vent his partner from committing errors, in the 
compensatory arrangement than in the noncom- 
pensatory arrangement. This fail-stop function 
should be most pronounced under high load, the 
condition in which flight error frequency should 
be maximal. Hence, error frequency was expected 
to be higher in the noncompensatory arrangement 
than in the compensatory arrangement, particularly 
under high load. 

Team-communication measures. All team com- 
munications were categorized according to the con- 
tent-analysis scheme described elsewhere (Williges 
et al., 1966). This scheme is based on five major 
categories of communication: identification, irrele- 
vant, declarative, tactical, and command messages. 
Teammates should have more reason to com- 
municate with one another in the compensatory 
arrangement than in the noncompensatory arrange- 
ment, and should have more “freedom” to do so 
under low-load than under high-load conditions. 
It follows that team communication should be most 
frequent in the compensatory low-load condition 
and should be most disruptive in the noncom- 
pensatory high-load condition. 


RESULTS 


Team performance. An analysis of variance 
of CE/VE ratios produced the following 
significant effects: system load, F(1/28)= 
12.63, p < .01; team arrangement, F(1/28) 
=4.24, p< .05; sessions, F(3/84)= 6.49, 
p< .01; and the System Load X Team Ar- 
rangement X Sessions interaction, F(3/84)= 
4.56, p < .01. The character of these effects 
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Noncompensatory High 
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Absolute Error 


Algebraic Error 


Fic. 2. The ratio of algebraic error to absolute 
error for each experimental condition as a function 
of sessions. 


is depicted in Figure 2. Specifically, the CE/ 
VE ratio decreased across sessions, increased 
with load, and was higher in noncompensatory 
teams than in compensatory teams, par- 
ticularly on Session 1 under low system load. 
In accordance with prediction, therefore, the 
extent to which “standardized” algebraic error 
was reduced at the expense of absolute error 
Was greatest in the compensatory low-load 
condition. The sessions effect arose from a 
greater intersession decline in algebraic error 
than in absolute error, which finding sug- 
gests that practice benefited CE more than it 
did VE. 

The time error per approach averaged 
11.13 sec. and 18.43 sec. under low and high 
load, respectively. These data indicate that 
the RC teams were under considerable time 
stress even under “low” load. Had the low- 
load criterion been, say, one approach every 
3 or 4 min., then the effect of team arrange- 
ment on the CE/VE ratio may have been 
even more obvious. 

The following sources of variance were 
significant in the analysis of variance of 
frequency of flight errors: load, F(1/28) 
= 13.01, ~<.01; sessions, F (3/84) = 
55.81, ~<.001; Load x Sessions, F(3/ 
84)= 3.49, p< .05; and the Load X Team 
Arrangement X Sessions interaction, F (3/84) 
=3.33, p < 05. The nature of these effects is 
apparent in Figure 3. The frequency of flight 
errors increased with system load and de- 
creased across sessions. As was expected, 
flight errors were less frequent in the com- 
pensatory arrangement than in the non- 


compensatory arrangement, but only in the 
first experimental session under high load. 

Team communication. Team arrangement, 
F (1/28) = 10.86, p< .01, communication 
category, F(4/112)= 26.72, p< .001, and 
Team Arrangement & Communication Cate- 
gory, F(4/112)= 2.83, p< .05, were all 
that attained significance in the analysis of 
variance of communication frequency. Con- 
trary to expectation, team communication 
was not influenced by load. In agreement 
with expectation, however, communication 
frequency was higher in compensatory teams 
(an average of 105 per session) than in 
noncompensatory teams (47 per session). 
Table 1 shows the relative frequencies of the 
five communication categories for each team 
arrangement. The relative frequencies of 
declarative messages, tactical communica- 
tions, and commands were higher in com- 
pensatory teams than in noncompensatory 
teams, and the relative frequency of irrele- 
vant communications was higher in noncom- 
pensatory teams than in compensatory teams. 
Identifications were relatively frequent in 
both team arrangements. 

Correlational analyses were performed to 
assess the relationship between communica- 
tion frequency and team performance (all 
measures averaged across sessions). Signifi- 
cant correlations were obtained only with 
respect to the noncompensatory high-load 
condition, in which Condition CE/VE, 16) 
= 81, p<.05, CE (algebraic error), 7(6) 
= 93, p< .01, and VE (absolute error), 
7(6)= .91, p< .01, were directly related to 
communication frequency. It may be recalled 
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Fic. 3. The total frequency of flight errors for each 
experimental condition as a function of sessions. 
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TABLE 1 


MEAN RELATIVE FREQUENCY OF OCCURRENCE OF 
EACH COMMUNICATION CATEGORY FOR 
Born TEAM ARRANGEMENTS 


x Parc} Team arrangement 
Communication s 


category 
Compensatory | Noncompensatory 
Identification 28 32 
Irrelevant -06 16 
Declarative 27 .22 
Tactical .20 15 
Command 19 15 


that noncompensatory teams were instructed, 
in effect, to maximize CE/VE by reducing 
VE at the expense of CE. At first glance, it 
might appear that, under high work load, 
team communication aided noncompensatory 
teams in their effort to maximize CE/VE. 
However, team communication augmented 
CE/VE not by reducing VE and elevating 
CE, but rather by increasing both error 
indexes, the increase apparently being more 
pronounced for CE than for VE. In actuality, 
therefore, team communication disrupted the 
performance of noncompensatory teams under 
high load. That this disruptive effect of team 
communication was restricted to the non- 
compensatory high-load condition is conso- 
nant with prediction and with the fact that 
irrelevant communications were relatively 
prominent in the communication profile of 
noncompensatory teams. 


DISCUSSION 


The present data lend testimony to the 
proposition that team performance can be 
augmented by the compensatory and fail-stop 
team functions. Despite the fact that the 
absolute level of system load was quite high 
even in the low-load condition, compensatory 
teams were able to partially offset accrued 
time error. Thus, the compensatory arrange- 
ment would be of service in real approach- 
control teams when the normal flow of air 
traffic has been impeded as a result of, say, 
adverse weather. However, this value of 
compensatory activity appears to be an in- 
verse function of system load. On the other 
hand, the utility of the fail-stop function is 


apparently a direct function of system load. 
Consequently, when system load is high, a 
team member may not be able to undo his 
partner’s errors once they have been com- 
mitted, but he may be able to reduce the 
number of errors that his partner in fact does 
commit. Hence, where one team function loses 
value, the other gains. 

Turning now to the communication data, 
it is apparent that team-communication fre- 
quency was augmented by compensatory ac- 
tivity but was not affected by system load. 
However, it is again noteworthy that the 
absolute level of work load was quite high 
even in the so-called low-load condition; a 
greater disparity in the two levels of load 
might have induced a disparity in com- 
munication frequency. In regard to the role 
of team communication in teamwork, the 
present findings further substantiate the pre- 
viously found disruptive effect of team com- 
munication on team performance (Briggs & 
Naylor, 1965; Johnston, 1966; Williges et al., 
1966). Of course, since this disruptive effect 
was limited to the noncompensatory high- 
load condition, it appears that team com- 
munication hinders team performance the 
most when there is the least need and the 
least freedom to communicate. When there is 
enough time and perhaps some reason to 
communicate, team communication neither 
retards nor enhances team performance. 

In summary, certain team functions such 
as team communication are potentially detri- 
mental to team output and should be re- 
stricted in system design and by the operat- 
ing procedures. Contrariwise, other team func- 
tions such as compensation and error pre- 
vention are potentially advantageous and 
should be incorporated into the design of 
those systems for which the unique features 
and effects of these team functions are 
appropriate. 
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LEARNING OF TELEPHONE NUMBERS WITH AND 
WITHOUT LETTER-EXCHANGE PREFIXES 


were 
numbers with letter exchanges or with 
terms of mean sores fina! 


Yumber-exchange systems in recent years 
have substantially replaced 
systems in telephone numbers. 
dividuals and even protest organizations, for 
example, the Anti-Digit Dialing League, have 
expressed opposition to telephone company 
policy in that regard. In part, their opposition 
das been based on the alleged increased diffi- 
culty in learning an all-digit code as opposed 
to learning a letter-digit code. 
esearch literature did not 


yield reports of experiments in the area of 


quests for such research reports of Bell Tele- 
phone Laboratories also yielded no experi- 
mental reports nor citations of any reports. 
The purpose of the study now reported 
concerning 


In a group situation, 36 college students, randomly 
tested for recall on S- or de en 


and 
of telephone numbers with lettered exchanges 
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tions 
to obtain evidence the relative Because there was the possibility 
efficiency of learning lists of telephone num- 2 
bers having alphabetical or letter excluded. 
two letters and five digits—as contrasted with Materials were presented to Ss in the form 
; which they were found in telephone directory. 
learning telephone numbers composed of An equivalent all-digit list was t 5 hon on 
digits exclusively—seven digits. letter-digit list. The learning materials are indicated 
METHOD 
Letter-Exchange List AN-Digit List 

Subjects. The Ss were 36 college students, most of 
them sophomores. They were assigned at random TR 49183 574-9153 
into four groups, nine Ss in each group. KE 6-1748 $36-1748 

Procedure. Group A practiced a 10-unit list; each WH 3-1526 943-1826 
unit was composed of 2 letters and 8 digits. This is MU $-9214 688-921 
the letter-exchange list. Group B practiced a 10-unit VA 6-139 826-1394 
list; each unit was com of 7 digits. This is the PL 2-685 752-6845 
all-digit list. Group C practiced an S-unit list, each FR 6-2419 376-2419 
unit composed of 2 letters 5 digits. These were SA 1-8935 721-8935 
the first 8 items in the list used by Group A Ss. MI 7 Sras 
Group D practiced an g-unit list, each unit com- HU 3-2 483-2965 
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The Ss performed under group conditions. Each 
S had a copy of a duplicated list.t They worked 
at long tables and had abundant room for materials 
and for writing. The Ss were given 20 practice 
periods each of which was immediately followed 
by a recall period. During the recall period, re- 
sponses were written on response forms coded from 
1 to 20; on completion of the recall period, response 
forms for each S were deposited by each S into 
S-coded envelopes. Each practice period was 30 sec. 
long and each recall period was 60 sec. long. In- 
tervals between the practice - recall sequences were 
about 10 sec. in duration. 


RESULTS 


All groups and individual Ss made progress 
in the direction of learning. Groups A, B, C, 
and D obtained mean scores of 4.8, 3.5, 4.8, 
and 3.3, respectively. In each case, the groups 
practicing telephone numbers with letter ex- 
changes performed better than those having 
lists of the same length with appropriate 
numbers substituted for the exchange letters. 

An analysis of learning progress by group- 
ing the data in terms of deciles, quintiles, and 
quartiles disclosed that at each point in the 
learning sequence the letter-digit group is 
superior to its parallel all- digit group. Per- 
formance of Groups A, B, C, and D on the 
twentieth, that is, the final, trial were 22 
6.0, 6.9, and 5.9, respectively. 

Individual differences in performance were 
obvious. In each group, with regard to total 
performance on the 20 trials, the score of the 
best performer was more than double that of 
the poorest performer. 


Discussion 


Despite “loading against” letter-digit com- 
binations (by excluding two-letter word ex- 
changes which would ordinarily occur some- 
times by chance), results indicate that from 
the standpoint of facility in learning telephone 
numbers, the traditional system with a letter 
exchange is more efficient than the “exchange- 
less” or all-digit system. With a 10-unit list, 
on the average, about 37% more material 
is learned of the conventional letter-exchange 
phone numbers over comparable all-digit 


The paired-associates technique was considered 
as an alternative method, but several problems 
concerned with that method and the primary objec- 
tive of this study suggested that the serial method 
was more desirable, 
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phone numbers. With an 8-unit list, on the 
average, about 459% more material is learned 
of the conventional letter-exchange phone 
numbers over comparable all-digit phone 
numbers. 

Two related viewpoints may be pertinent 
in attempting to explain the major finding of 
this research, namely, the superiority of 
letter-exchange learning. These are termed the 
isolation viewpoint and the interference view- 
point. 

1. The isolation viewpoint maintains that 
a small group of symbols of one kind among 
a more numerous group of symbols of another 
kind are apprehended and learned more 
readily, perhaps as a function of contrast. 
In any case, this is related to Von Restorff’s 
(1933) finding that if a syllable is inserted in 
a list of numbers or a number in a series of 
syllables the exceptional or “isolated” item 
will be better recalled. 

2. The interference viewpoint maintains 
that because of the similarity or relative 
homogeneity of items in the all-digit list, 
proactive and retroactive influences within the 
list may have interfering effects on recall. 
A variety of researches on proactive and 
retroactive effects provide research support 
for such a position; much of the literature in 
this area has been summarized by McGeoch 
and Irion (1952, pp. 404-447) and by Under- 
wood (1957). 

There is also a third possibility which is 
that letters are easier to learn than numbers; 
this may be a function of greater frequency of 
contact (experience) with letter material than 
number material under ordinary circum- 
stances. 

Assuming that the results are meaningful 
and generalizable, for the convenience of their 
subscribers, telephone companies might con- 
sider a return to the earlier more conventional 
letter-exchange system involving, if not a 
name exchange, at least a pair of letters. 
The use of letters as opposed to numbers 
provides for a distinct means of differentiat- 
ing, designating, and identifying—perhaps, in 
most cases, with less opportunity for retro- 
active and proactive inhibition. Further, one 
might have several different letter designa- 
tions for the same set of numbers. For ex- 
ample, a 74 could be PI, RI, or SI as well as 
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PH, RH, and SH. This would achieve better 
differentiation than using 74 cach time. 
749-1752 and 748-6945 might be better 
learned as SI 9-1752 and PH 8-4645. This, 
of course, assumes that in the lists to be 
learned, other combinations than 74 are most 
numerous. 

The letter exchange need not necessarily 
be the first two letters of a meaningful word 
as has usually been the case. For example, 
the 83 “exchange” might be dialed if the 
prefixes were VD, TD, or UF, none of which 
are words or word prefixes in the English 
language; but all are distinct from each other 
and all are commonly used abbreviations 
and, therefore, not 
meaning. 

Three-letter combinations of words or of 
nonsense syllables are also not beyond the 
realm of possibility. 
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Spurenfeld. Psychologische Forschung, 1933, 
18, 299-342. 
(Early publication received July 18, 1967) 
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PREDICTION OF SALES SUCCESS FROM FACTORIALLY 
DETERMINED DIMENSIONS OF PERSONAL 
BACKGROUND DATA 


MELANY E. BAEHR ano GLENN B. WILLIAMS 


Industrial Relations Center, University of Chicago 


A concurrent validity study was made of the scores of 210 salesmen and 
16 district managers on 15 personal background dimensions, identified previ- 
ously by factor analysis. Differences between factor-score means for the 
manager and sales groups and for the upper- and lower-rated sales groups 
were significant for the factors Financial Responsibility (.001, 001), Early 
Family Responsibility (.01, .01), and Stability (.001, 05). Multiple-regression 
analyses of the factor scores against each of 5 criterion measures of on-the-job 
behavior yielded correlations of 42, .50, and .36 for the 3 major criteria of 
overall performance, and mean and maximum sales volume rank. The criteria 
route difficulty and tenure gave correlations of .27 and .30. Interpretation of 
the highest-weighted factors in these analyses indicates the logical, dynamic 
relationships between personal background and job behavior. 


The bulk of research on the prediction of 
occupational success from quantitative per- 
sonal-background-data items has taken an 
essentially empirical approach (Buel, 1965; 
Dunnette & Maetzold, 1955; Lockwood & 
Parsons, 1960; Minor, 1958; Mosel & Wade, 
1951; Scollay, 1956; Smith, Albright, & 
Glennon, 1961). It first determines the dis- 
criminative power of individual items in 
measuring a given criterion of success in 
some occupation. Then it combines those 
items which discriminate (with or without 
weighting) into an instrument which yields 
an index or “score.” Such scores have been 
individually cross-validated for a variety of 
occupations in industry and the armed ser- 
vices. They have also contributed materially 
to multiple correlations of test batteries with 
empirical criteria of performance. However, 
the approach has been criticized for its basic 
empiricism (Dunnette, 1962; Morrison, 
Owens, Glennon, & Albright, 1962) and its 
failure to shed any light on the dynamic rela- 
tionships between personal background on the 
one hand and occupational choice and suc- 
cess on the other. 

The present investigation is a sequel to a 
study designed to identify and define some 
of the underlying dimensions of personal 
background data (Baehr & Williams, 1967). 
The dimensions were identified through suc- 
cessive factor analyses of responses to a wide 
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variety of background items by an occupa- 
tionally heterogeneous sample of 680 male 
employees. The final factor analysis yielded 
15 interpretable oblique first-order factors. 
A factoring of the correlations among these 
primaries produced five uncorrelated second- 
order factors which appeared to represent 
broad behavior patterns associated with the 
needs and achievement drives of individuals. 

Two tests were run on the efficacy of the 
background dimensions, as measured by the 
first-order factor scores, in differentiating 
among the 10 occupational groups covered by 
the sample. An analysis of variance yielded 
F ratios which were virtually all significant 
beyond the .05 level of confidence, the ma- 
jority of them beyond the .001 level. A 
complete series of f tests revealed that each 
pair of occupational groups showed differ- 
ences which were significant at the .05 level 
or better and, conversely, that each factor 
discriminated significantly among some of the 
pairs. 

These findings were considered an adequate 
basis for concluding that the dimensions of 
the personal background data were related 
to occupational classification. Further, as 
suming that an individual’s occupational clas- 
sification is to some extent under his own 
control, then one can assume that the back- 
ground dimensions are also related to oc- 
cupational choice. Moreover, since the ma- 


jority of the cases in the sample had been 
in their given occupation for some y 

thus must be regarded as at least moderately 
successful in it—the final assumption can be 
made that the background dimensions are 
related to occupational success. In the present 
study, this final hypothesis was experimentally 
investigated. The Ss were the 210-man sales 
force and its 16 district managers of a 
specialty-food manufacturing organization 
with branches at various locations through- 
out the United States. 


METHOD 
Variables 


The predictor variables were the unit-weight 
factor scores derived from the 15 first-order factors. 
These factors are defined as follows: 


Factor 1 (School Achievement). Academic achieve- 
ment, particularly in high school, but also, where 
applicable, at college. A general liking for, and 
adjustment to, the school environment. 

Factor 2 (Higher Educational Achievement). 
Special or technical accomplishment and qualifica- 
tion resulting in a relatively late vocational start 
and late assumption of family responsibilities This 
pattern is characterized by eventual occupational 
and financial achievement. 

Factor 3 (Drive). Inner drive to be outstanding 
in performance, to attain high goals even if this 
entails temporary setbacks, to supervise others, and 
to achieve success and advancement. 

Factor 4 (Leadership and Group participation). 
A desire to establish contact with others as shown 
by membership and interpersonal activity in orga- 
nizations and i 
through community and social activities. 

Factor 5 (Financial Responsibility). Ability to 
manage a personal economy of defined proportions— 
to earn, invest, save, and accumulate. 

Factor 6 (Early Family Responsibility). Early 


husband ordinarily being the sole provider. Demon- 
strated achievement in handling the family’s financial 
affairs. Outside the work situation, 
interest is in family activities. 
Factor 7 (Parental Family Adjust ment). Develop- 
ment of realistic, constructive attitudes and rela- 
tionships in the early family environment. In- 
cluded are relationships among siblings, between 
parents, and between the child and the parents. 
Factor 8 (Stability). Established security and 
stability in the work resulting from @ 
past history of good performance, The present con- 
cern is more to maintain what has been achieved 
than to plan for improvement or development. 
Factor (School Activities). Major emphasis on 
active participation in athletic and extracurricular 
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sodal activities at bigh school but abo indications 
of good academic 


parental background characterized by 4 
father, eitber self-employed or in one of the pro- 
fessions, and by material comfort and a bappy 
home life. 

Factor 11 (Educational-V ocational Consistency). 


Factor 12 (Vocational Decisiveness). Decisiveness 
and purposefulness in 
achieving the necessary qualifications, followed by 
an early start in the chosen occupation. 

Factor 13 (Vocational 
with occupational choice and the expectation that 
peak performance will be some time in the future. 
There is, however, no of consistency of 
application or of drive to achieve high standards 
of performance. 

Factor 14 (Selling Experience), Various kinds of 
including door-to-door selling 
and transactions in real estate. 

Factor 15 (General Health). Generally better than 
average health over an extended period (childhood, 
adolescence, adulthood). General freedom from 
physical ailments and from loss of work time 
resulting from illness. 


The criterion indexes were five measures of on- 
the-job performance- three major ones of these 
were directly related to success in selling. 


manager, and either the office manager or the 


posite performance measures used in this investiga- 
tion. 

Mean sales volume rank For the past 10 yr. 
n has ranked each member of its 
sales force on the basis of his volume of sales for 
the year. The criterion measure used here is the 
average of all ranks assigned to a salesman over 
the period in which his sales volume was ranked. 

Maximum sales volume rank. (See Footnote 1.) 
This figure is the highest ranking the man recel 
over the last 10 yr. or over his period of tenure if 
this were less than 10 yr. 

Route difficulty. Each branch manager used the 
paired-comparison technique to rate the sales routes 
in his region from most to least difficult in the 


1Jn the calculations presented in this paper, the 
been reflected. Thus, a high rank in- 
volume. The commission re- 
ceived by salesmen is based directly on sales volume. 
For this reason, sales volume was selected as a 
performance criterion rather than pay, where the 
performance aspect is obscured by higher base pay 
for long-tenure salesmen. 
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TABLE 1 
CORRELATIONS BETWEEN CRITERION MEASURES 


Peer Performance Mean sales Maximum Route 
Criterion measure rating zer PEENE difficulty | Tenure 
Performance rating 37% 24% —.20** —.13 
Mean sales rank 38 Jene .02 18 
Maximum sales rank aa NY cg 02 .16* 
Route difficulty —.20** .02 02 —.02 
Tenure —.13 -18** .16* —.02 
*p <.05. 
* 2.01. 
* < 001. 
realization of sales potential. These ratings were lated separately for each of the five criterion 
converted to normalized standard scores. measures. 
Tenure as a salesman. This measure was the num- 
ber of years the individual had been employed in RESULTS 


the organization as a salesman. 
The correlations between these criterion measures 
are shown in Table 1. 


Subjects 


The sales personnel who were Ss in this study 
were not part of the sample used in identifying 
the personal background dimensions. The sales 
groups in the original 680-man sample were paper- 
forms salesmen and district sales managers in the 
insurance business. The 210 salesmen in the present 
investigation were employed in a food-products 
manufacturing organization supplying institutions 
and retail outlets. Their average education was 
1-2 yr. of college and their average age 42 yr. 

The 16 district managers, representing the first 
line of sales management in the organization, had 
about the same educational background as the 
salesmen. Their average age was 48 yr. Since the 
organization has a strong policy of promotion from 
within and since the district manager’s job still in- 
cludes a fair amount of personal selling, it may be 
assumed that the district managers represent the 
upper end of the performance continuum for the 
sales force. 


Procedure 


The Ss’ factor scores were secured by administer- 
ing a multiple-choice questionnaire called the Per- 
sonal History Index (Baehr, Burns, & McMurry, 
1965). 

T-test analyses. A t-test analysis was made of the 
significance of the differences between the mean 
personal-history factor scores of the group of dis- 
trict managers and of the sample sales group. An- 
other such analysis was run on the differences be- 
tween the mean factor scores of those salesmen 
with ratings of 55 and above on the standard-score 
scale for the paired-comparison assessments and 
those with ratings of 45 and below. 

Multiple-regression analyses. A multiple correla- 
tion for the personal background factors was calcu- 


Results of the ¢-test analyses are shown in 
Tables 2 and 3. The three factors of Financial 
Responsibility, Early Family Responsibility, 
and Stability differentiated at statistically sig- 
nificant levels both between the salesman and 
district manager groups and between the 
upper- and lower-rated sales groups. The 
scores on these factors rise successively from 
the lower-rated salesmen through the upper- 
rated salesmen to the district managers. These 
results correspond with the assumed level of 
sales performance of each of these groups. It 
is interesting to note that the three differ- 
entiating factors are among the first 8 of the 
15—those which had previously been identi- 
fied as having the most potential for opera- 
tional use (Baehr & Williams, 1967). 

Results of the multiple-regression analyses 
are given in Table 4. The computer programs 
for these analyses were written in such a way 
that a variable would be included only if its 
degree of variance produced an F ratio at the 
05 level of confidence or better. The ob- 
jectives of this programming were to prevent 
undue “forcing” of variables into the regres- 
sion equations and to lessen the effects of 
purely chance relationships of the variable 
and the criterion. 

The three major criterion measures of 
success are the paired-comparison performance 
rating, the mean sales volume rank, and the 
maximum sales volume rank. Table 1 shows 
that the latter two measures (because of their 
common source data) are highly correlated 
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TABLE 2 


SIGNIFICANCE OF THE DIFFERENCES BETWEEN MEAN Personat-History FACTOR Scores FOR THE 
District SALES MANAGER AND THE SALESMAN GROUPS 


— — — — 


District sales managers Salesmen 

Personal-history factor WW = 16) w= atm t 
Z M sD wu | 

SSS — —— 
School Achievement 244 1.37 2.19 1.56 62 
Higher Educational Achievement 5.37 1.32 4.58 1.79 1.75 
Drive 4.75 2.22 4.23 | 2.00 Kad 
Leadership and Group Participation 2.81 1.67 4.78 1667 10 
Financial Responsibility 6.50 1.06 4.86 1.80 3.87 
Early Family Responsibility 9.50 1.54 7.95 243 2.0" 
Parental Family Adjustment 7.00 1.73 6.62 2.23 6 
Stability 7.31 77 5.56 2.11 3.29% 
School Activities 3.31 2.02 3.28 1.87 20 
Professional Successful Parents 4.19 1.38 3.73 1,58 1.13 
Educational-Vocational Consistency 1.69 .98 1.33 97 1.40 
Vocational Decisiveness 2.75 1.15 2.17 1.17 1.91 
Vocational Satisfaction 6.12 1.00 5.80 1.80 .70 
Selling Experience 6.56 1.54 6.23 1.48 85 
General Health 3.19 1.55 3.4 1,08 80 
9 S700. 


(r=.75). The performance rating shows The multiple correlation for the perform- 
moderate correlations (aT and .24) with ance-rating criterion was 42 and for the 
the mean and the maximum sales volume mean and maximum sales volume ranks, .50 
and .36, respectively. In the regression anal- 


ranks. 


THE DIFFERENCES BETWEEN MEAN PERSONAL-HISTORY FACTOR SCORES FOR THE UprER-RATED 


School Achievement y 1 i i g 
Higher Educational Achievement 0 4 X i f 
Drive 5 l d k 
Leadership and Group Participation f 5 R i 148 
Financial Responsibility s 1 > È 42.5 
Early Family Responsibility 8 d ` F 3.17 
Parental Family Adjustment f $ i A .80 
Stability . ; J : 2.46* 
School Activities . 1 i ci 73 
Professional Successful Parents 5 1. j 8 73 
Educational-Vocational Consistency . : : 95 
Vocational Decisiveness 5 ` z 4 zA 
Vocational Satisfaction i ‘ . 1250 
Selling Experience ö . 5 : ot 
General Health . „ „ : : 
*p <.05. 
* < 01. 
weep < 001. 
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TABLE 4 
MULTIPLE-REGRESSION ANALYSES 
Accepted personal- Partial] Simple Beta 
history factor r f weight 


Criterion: Paired-comparison performance rating 
(multiple R = .42) 


5. Financial Responsibility 33 cae! 34 
4. Leadership and Group 
Participation 14 14 14 
13. Vocational Satisfaction 14 .06 14 
7. Parental Family Adjust- 
ment —.12 —.08 | —.11 
6. Early Family Responsi- 
bility 1 


1. School Achievement —.10 | —.05 | —.10 


Criterion: Mean sales volume rank (multiple R = .50) 


5. Financial Responsibility 31 43 33 
8. Stability 21 30 .22 
15. General Health —.12 | —.05 | —.10 
2. Higher Educational 
Achievement 12 12 10 
1. School Achievement —.10 | —.08 | —.09 
Criterion: Maximum sales volume rank 
(multiple R = .36) 
5, Financial Responsibility 20) 22 
8. Stability 12 27 13 
7. Parental Family Adjust- 
ment —.11 | —.11 —.10 
6. Early Family Responsi- 
bility 10 17 10 
Criterion: Route difficulty (multiple R = .27) 
3. Drive 19 16 19 
11. Educational-Vocational 
Consistency 17 12 217 
6. Early Family Responsi- 
bility —.12 —.09 | —.11 
15. General Health —.11 —.11 | —.11 
Criterion: Tenure as a salesman (multiple R = .30) 
13. Vocational Satisfaction —.20 | —.18 | —.20 
6. Early Family Responsi- 
bility —.19 | —.18 —.18 
12. Vocational Decisiveness 16 12 16 


yses on all three criterion measures, the 
factor Financial Responsibility carried the 
greatest weight. For both the sales volume 
criterion measures, the next greatest weight 
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was on the factor Stability. Interestingly 
enough, however, the second highest weighted 
factor for the performance: review criterion, 
which involved the judgment of the men's 
superiors, was Leadership and Group Par- 
ticipation. The factors Parental Family 
Adjustment (negative), Early Family Re- 
sponsibility, and School Achievement each 
had weights in two of the three analyses. 
Again, it should be noted that all factors 
mentioned are among the first eight—those 
recommended for operational use. 

The two remaining criterion measures— 
route difficulty and tenure as a salesman— 
yielded multiple correlations of .27 and 30, 
respectively. These coefficients are admittedly 
modest. However, they do indicate the flexi- 
bility of the background dimensions in pre- 
dicting a number of different criteria, since 
the correlations of the two minor criteria with 
the three major ones are either zero or in- 
significantly negative. 

In the route-difficulty analyses, the factor 
Drive carried the greatest weight. This is 
logical, since a high score on Drive, as Drive 
is defined here, would be associated with a 
willingness to take on the difficult and chal- 
lenging sales routes. Enquiries to the organ- 
ization did indeed elicit the fact that the 
older, less successful salesmen were generally 
to be found on the easier routes, which de- 
manded mainly a servicing of established 
outlets rather than any real sales activity in 
Opening new accounts, 

The highest weight in the tenure analysis 
is a negative one on the factor Vocational 
Satisfaction. This apparently startling result 
can be explained on the basis of the factor 
definition and the particular sales force in- 
volved. In this rather old and established 
sales group (average tenure is over 10 yr.), 
the salesmen of longest standing are gen- 
erally the men on the less difficult, servicing 
routes who no longer strive for promotion or 
for self-improvement through relocation. As 
a result, they are probably resigned to, rather 
than satisfied with, their choice of occupation 
and have no reason to feel that their period 
of peak performance is some time in the 
future. 

Results of this nature emphasize the need 
for care in interpreting personal-history data 
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for purposes of predicting on-the-job behavior. 
The “blind” application of an “index” derived 
from the items in this factor would, it is true, 
probably result in a sales force with low 
turnover. However, that sales force would 
tend to consist of older indivduals who would 
feel they were past their peak performance, 
whose aspiration levels were low, and who 
were more interested in simply maintaining 
their present position than in advancing 
in the organization. 


Discussion 


On the basis of these results, the picture 
that emerges of the successful salesman or 
sales manager is one of a man with a back- 
ground of competent handling of his personal 
economy, an early vocational start with prime 
or sole responsibility for managing family 
finances, and, particularly for the managers, 
a past history of sales achievement and pres- 
ent stability in the work and family situa- 
tions. 

An interesting finding in the analysis of 
the criterion measures of on-the-job behavior 
was that the greatest weight for tenure was 
a negative one on the factor Vocational 
Satisfaction. This latter finding leads to the 
conclusion that it would be inexpedient to 
predict tenure on the grounds of vocational 
dissatisfaction unless the organization is pre- 
pared to maintain a stable, low-turnover 
sales force at the expense of having it com- 
posed of men who feel they cannot improve 
their performance either in other positions 
or in other companies. Further research 
should make it possible to define and measure 
objectively the different background di- 
mensions which characterize successful per- 
formers in a variety of occupations. 

A general aim of this study was to stress 
the need for a rational as well as an empirical 
approach to the use of personal background 
data. An attempt was made, therefore, to 
interpret the significance of the personal back- 
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ground factors in the life of the individual 
and to show their relationships to successful 
performance in different occupational settings. 
By contrast, the more frequently used em- 
pirical approach will yield, for each occupa- 
tional group studied, a frequently highly 
discriminative but uninterpretable composite 
score or index. 
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GOALS AND INTENTIONS AS MEDIATORS OF THE EFFECTS 


OF MONETARY INCENTIVES ON BEHAVIOR * 
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Based on the assumption that goals and intentions are the most immediate 
determinants of an individual’s behavior, it was hypothesized that monetary 
incentives would affect task performance only through or by means of their 
effects on the individual's goals or intentions. 5 experiments were performed 
to explore this hypothesis. 2 dealt with the relationship of performance goals 
to level of performance (output) on a task as a function of incentive condi- 
tion. 3 experiments examined the relationship of behavioral intentions to task 
choice as a function of incentive. In all 5 studies, significant relationships were 
obtained between performance goals or behavioral intentions and behavior. 
However, when goal or intention level was controlled, there was no effect of 
monetary incentive on behavior. In the 3 choice studies where incentives did 
have an initial effect on choice, these choice differences were accompanied by 
equivalent differences in intentions. When the latter were partialed out, the 
original incentive effect was vitiated. The data were interpreted as supporting 


the hypothesis. 


A recent review of the literature on the 
motivational effects of financial compensation 
(Opsahl & Dunnette, 1966) stressed the need 
for more basic research on the mechanisms 
by which money motivates action. The authors 
stated that: 


. there is probably less solid research in this 
area than in any other field related to worker per- 
formance. We know amazingly little about how 
money either interacts with other factors or how 
it acts individually to affect job behavior 
Speculation, accompanied by compensation fads 
and fashions, abounds; research studies designed 
to answer fundamental questions about the role 
of money in human motivation are all too rare 
[p. 94]. 


The present studies were designed to help 
remedy this situation. The authors’ approach 
was based on Ryan’s (1958) theory of moti- 
vation? which states that: “a very large pro- 


1This research was supported by contract Nonr 
4792(00) between the Office of Naval Research and 
the American Institutes for Research. The views 
expressed are not necessarily those of the Navy 
Department. The authors would like to thank 
Charles D. Spielberger of Florida State University 
for his helpful criticisms of an earlier draft of 
this manuscript. 

2 Primary affiliation now with the University of 
Maryland. 

For a more recent statement of Ryan's theory, 
see also unpublished mimeos, 1964. Chapter I: Ex- 


portion of behavior is initiated by tasks [goals, 
intentions] and a very large proportion of tasks 
lead to the behavior specified by the tasks 
A task is a necessary condition for most kinds 
of behavior [p. 79].” If tasks, goals, or in- 
tentions are a necessary condition for action, 
then incentives such as money should affect 
action only if and to the extent that they 
affect the individual’s goals and intentions. 
This was the major hypothesis tested in the 
studies to be reported here. 

While the above represents little more than 
refined common sense, it should be recog- 
nized that the use of conscious goals and 
intentions as explanatory concepts in psy- 
chology is virtually nonexistent today. It is 
much more common to view incentives (“re- 
inforcements”) as shaping behavior auto- 
matically (without the operation of conscious 
volition). The present studies provided an 
opportunity to test the validity of the latter 
assumption. 

Goals can be differentiated both qualita- 
tively (e.g., “do your best” versus “try to 


plaining behavior; Chapter II: Explanatory con- 
cepts; Chapter V: Experiments on intention, task, 
and set; Chapter VI: Intentional learning; Chapter 
VII: Unintentional learning. Cornell University, 
Department of Psychology. 
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improve”) and quantitatively (e.g., “try for 
a score of 50” versus “try for a score of 
70”). In the latter case, goal level would be 
synonymous with “level of aspiration.” Two 
of the present investigators have reported a 
number of studies showing strong relation- 
ships between S’s goals and level of perform- 
ance on laboratory tasks. These results in- 
dicate that hard goals produce a higher per- 
formance level (greater output) than easy 
goals (Bryan & Locke, 1967b; Locke, 1966a, 
1966b; Locke & Bryan, 1966b, 1967a, 1967b) ; 
and that specific (quantitative) hard goals 
lead to a higher performance level than a goal 
of “do your best” (Bryan & Locke, 1967a; 
Locke, 1967; Locke & Bryan, 1966a, 1966b, 
1967a, 1967b). A number of studies by 
other investigators also support the assump- 
tion that goals and intentions are important 
regulators of task performance (DeNike, 
1965; Dey & Kaur, 1965; Dulany, 1962; 
Mace, 1935; Stedry, 1962). 

Several recent studies of nonmonetary in- 
centives have suggested that such incentives 
affect choice behavior only through their 
effects on the individual’s conscious goals and 
intentions. For instance, Spielberger, Berger, 
and Howard (1963), Spielberger, Bernstein, 
and Ratliff (1966), and Spielberger, Levin, 
and Shepard (1962) found that the effect 
of verbal reinforcements on verbal behavior 
was a direct function of S’s awareness of the 
concept to be learned and the desire of S to 
get the reinforcement. Dulany (1962) and 
Holmes (1966) found that the individual’s 
verbal responses on such tasks were a direct 
function of his behavioral intentions. 

Locke and Bryan (1966b), using post- 
experimental goal questionnaires, found that 
knowledge of score had no effect on perform- 
ance but that goals were significantly related 
to output. Locke (1967) noted that previous 
studies in this area often confounded goal- 
setting and knowledge effects. The Ss given 
knowledge of score were typically given a spe- 
cific hard goal to aim for, while Ss not given 
knowledge of score were typically told to do 
their best. When these two factors were sepa- 
rated by independent experimental manipula- 
tion of goals and knowledge of score, Locke 
(1967) found that specific hard goals pro- 
duced a higher level of performance than a 
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goal of “do your best,” but that there was 
no effect of knowledge of score per se on 
performance. 

Goal setting as a mediator of the effects of 
time limits on work speed was investigated 
by Bryan and Locke (1967b). It was found 
that time taken to complete a task expanded 
to fill the time available (in line with Parkin- 
son’s Law) or contracted to get done in the 
time allowed, but only to the extent that Ss 
set different goals under the different time 
limits. 

Finally, Meyer, Kay, and French (1965) 
found that participation in performance ap- 
praisal interviews was relatively unlikely to 
lead to improved performance unless specific 
improvement goals were set during the in- 
terviews. The authors write: 


While subordinate participation in the goal- 
setting process had some effect on improved per- 
formance, a much more powerful influence was 
whether goals were set at all.. 

The average percent accomplishment estimate for 
those performance items that did get translated into 
goals was 65, while the percent estimate for those 
items that did not get translated into goals was 
about 27 [pp. 126-127]! 


To the authors’ knowledge, no studies have 
been made of goals and intentions as medi- 
ators of the effects of monetary incentives, 
although preliminary findings by Spielberger, 
Ratliff, and Bernstein (1966) suggested that 
the desire of Ss to earn money on a verbal 
learning task played a role in determining its 
effects on performance. Whyte (1955) has 
shown in real-life settings that the effects of 
monetary incentives on output are by no 
means automatic. A series of behavioristically 
oriented experimental studies of monetary 
incentives by Toppen (1965a, 1965b, 1965c, 
1966) found the following relationships be- 
tween monetary reinforcement schedule and 
performance output on a lever-pulling task: 
(a) larger reward magnitudes and higher re- 
inforcement frequencies led to higher output, 
(b) piece-rate payment yielded a higher out- 
put than time-rate payment, and (c) decreas- 
ing the magnitude of the reward over time 
resulted in performance decrement. Although 
Toppen asserts repeatedly in these studies 
that humans respond “just like animals” to 
reinforcement, his own findings and experi- 
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mental procedures consistently contradict this 
claim. Toppen repeatedly refers to the effects 
on performance of the Ss’ “feelings of obliga- 
tion” to E, their “expectations” and “in- 
terests,” their “ulterior motives,” and their 
“desire to do well,” and he uses elaborate 
instructions and procedures to try to control 
and eliminate such effects. His very efforts in 
this respect reflect the importance and 
ubiquity of conscious factors in human per- 
formance. 

What Toppen ignored was the fact that be- 
tween the offering of a monetary incentive 
and an individual’s overt action, a complex 
sequence of conscious and subconscious 
processes occur. The possibility that these 
processes might determine kow an individual 
will react to the incentive was evidently not 
considered. For instance, before an external 
object (which does not physically touch S) 
can affect an individual’s actions he must 
(ordinarily): perceive and identify it, that is, 
be aware that it exists and of what it is; fol- 
lowing perception and identification comes an 
appraisal of the object against some standard 
of value, that is, an estimate of the extent to 
which the object will be beneficial or harmful 
to him. As a result of this appraisal (which 
may be entirely subconscious), he may experi- 
ence an emotion: “the psychosomatic form in 
which man experiences his estimate of the 
beneficial or harmful relationship of some 
aspect of reality to himself |Branden, 1966, 
P. 5].” On the basis of his emotional reaction, 
his conscious values, and his perception of the 
task and situation, the individual will usually 
(explicitly or implicitly) set himself a goal 
or end toward which he will direct his be- 
havior and in terms of which its appropriate- 
ness will be judged. He will develop an in- 
tention to act in a certain way. 

It is assumed that the individual’s specific 
goals and intentions are the end result or 
culmination of the complex intervening mental 
processes described above and the most direct 
determinants of actual performance. The pres- 
ent studies were not concerned with the 
question of why particular goals or intentions 
were developed by particular individuals, but 
only with the relationship between such goals 
or intentions, once established, and perform- 
ance. To use Ryan’s (1958) terminology, the 
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authors were concerned with specific or im- 
mediate level explanation. 

To demonstrate that goals and intentions 
were direct determinants of performance and 
mediators of the effects of incentives on be- 
havior, it would have to be shown: (a) that 
goals and intentions were related to behavior 
regardless of incentive condition, but (b) 
that the reverse was not true—that incentives 
did not affect behavior unless they affected 
goals and intentions. The above requirements 
can be expressed more precisely in terms of 
three specific subhypotheses which were the 
ones tested in the present studies: 

Hypothesis 1. Goals and intentions will be 
related to behavior regardless of incentive 
condition; that is, goals and intentions will 
be related to behavior, both within and 
across different incentive conditions. 

Hypothesis 2. When incentive differences 
do correlate with behavior differences, these 
differences will be accompanied by corre- 
sponding differences in goals or intentions. 

Hypothesis 3. When goal or intention dif- 
ferences are controlled or partialed out, there 
will be no relationship between incentive 
condition and behavior. 

Hypothesis 1 is simply a restatement of the 
general assumption that goals and intentions 
are direct determinants of behavior. If this 
assumption is valid, then the relationship 
should hold up regardless of incentive condi- 
tion. Hypotheses 2 and 3 are directly con- 
cerned with the role of goals and intentions 
as mediators of the effects of incentives. If 
incentives do affect behavior, they should 
affect goals and intentions at the same time 
(Hypothesis 2); if goal or intention differ- 
ences are controlled, incentives should have 
no effect (Hypothesis 3). 

The above three hypotheses were tested 
with respect to two different dimensions of 
behavior: level of performance or quantity 
of output, and direction of behavior or choice. 
Two experiments were performed in which all 
Ss worked on the same tasks under different 
incentive conditions, and level of performance 
(productivity) was the dependent variable. 
The relevant motivational concept in this sit- 
uation is the individual’s performance goal or 
aspiration level. If monetary incentives affect 
performance automatically then different 
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amounts of incentive should produce different 
levels of performance regardless of the level 
of the Ss’ goals (e.g., the higher the incentive, 
the higher the performance level). On the 
other hand, if conscious goals mediate the 
effects of incentives, then incentive differences 
should only produce performance: level differ- 
ences if and to the extent that they produce 
goal-level differences. 

Three experiments were performed in which 
Ss had a choice as to the difficulty of the 
tasks they would work on, and choice diffi- 
culty was the dependent variable. Incentives 
were offered for solving a task (problem) 
successfully regardless of the difficulty of the 
task. This meant that Ss were “reinforced” 
with money more frequently for choosing 
easy tasks than for choosing hard tasks. 

The relevant motivational concept in a choice 
situation is the individual’s behavioral in- 
tention (Dulany, 1962), the choice or re- 
sponse that the individual intends to make. 
If monetary incentives affect task choice 
automatically, then Ss who are paid for suc- 
cess only should be more likely to choose 
easy tasks as the payment for success in- 
creases, regardless of their intentions. On the 
other hand, if behavioral intentions mediate 
the effects of incentives on task choice, then 
incentive differences should produce task- 
choice differences only if and to the extent 
that they produce intention differences. 

The first two experiments deal with level 
of performance and the last three with choice. 
The first, fourth, and fifth experiments were 
designed and carried out by the first author; 
the second was designed and carried out by 
the second author; the third study was the 
responsibility of the third author. 


EXPERIMENT 1 


Method 
Subjects 
The Ss were 127 paid, undergraduate volunteers 
(71 males and 56 females) recruited from the Uni- 
versity of Maryland. All Ss received $3 for participa- 
tion in addition to whatever money (if any) 
they earned in the experiment. 


Task 


The task was giving uses for objects and was 
introduced as a test of creativity. On each trial, S 
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was given a common object (eg, a cardboard box), 
the task being to list possible uses for the object. 
There were two practice trials followed by three 
blocks of seven experimental trials each. Trials 
lasted 1 min, The Ss wrote their responses on answer 
sheets. During the experiment, Ss kept track of their 
own scores with the instruction that the protocols 
would be rescored after the experiment. The E 
did rescore them, eliminating only those responses 
that were not uses (eg, if S said “to sell” an ob- 
ject without telling why he sold it, it would not 
be considered a use), Thus, S's score was the total 
number of uses given without regard to quality, 


Procedure and Conditions 


After the rules were explained to Ss, they were 
given the two practice trials and told to do their 
best. On the first block of seven experimental trials, 
all Ss in Conditions A through E (see below) were 
told to “work at what you would consider to be 
a normal pace for an eight hour day, assuming no 
time limits and that you would not be paid for 
production.” This block was of no direct relevance 
to the present study and will not be discussed 
further. The experimental conditions were intro- 
duced before Blocks II and III. The Ss in each con- 
dition were run together. 


Condition A: Assigned goals; all-or-none incen- 
tive (V 19). Before Block II, Ss were assigned 
the goal of attaining, on each trial in that block, 
a score equal to one more than their mean score 
on the two practice trials. They were offered a S¢ 
bonus for each trial on which they reached this 
goal. 

Before Block III, Ss were assigned a goal of 
attaining, on each trial, a score equal to one more 
than their best score in Block II. They were offered 
a 10¢ bonus for each time they reached or ex- 
ceeded this goal. 

Condition B: Assigned goals; no incentive (N = 
17). This condition was identical to Condition A, 
except Ss were not offered any bonus for goal 
attainment. 

Condition C: Selj-set goals; piece-rate incentive 
(N = 22). The incentives were set for this group 
so that if they performed at the same level as 
Group A, they would receive, on the average, the 
same total pay on each block as Group A had 
received. Thus Ss were offered 6/10 of a cent per 
“use” on each trial on Block II and 4/10 of a cent 
per “use” on Block III. So that their goal level could 
be determined, Ss were told to indicate what their 
goal would be before each block. They were told 
that the goal would have nothing to do with their 
pay, which depended upon the number of uses 
given, but that they should indicate the goal (score) 
they were actually going to try for on each of the 
seven trials in that block. 

Condition D: No goal setting; piece-rate incentive 
(N=21), This condition was identical to Con- 
dition C, except that Ss did not set any explicit 
goals each block. 
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Condition E: Self-set goals; no incentive (N= 
12). This condition was identical to Condition C, 
except that no incentive for performance was offered. 

Condition A’; Assigned goals; all-or-none incentive 
(N = 21). This condition was similar to Condi- 
tion A above, except that the goals in this case 
were set higher than for Group A. On Block I the 
A’ condition Ss were told to do their best. Each 
S's Block II goal, for each trial, was one more than 
his best score on Block I, and his Block III goal, 
for each trial, was one more than his best score 
on Block II. The incentives were 10¢ and 25¢ for 
reaching or exceeding the goals on Blocks II and III. 
respectively. 

Condition B': Assigned goals; no incentive (N= 
15). This condition was identical to Condition A’ 
above, except that no incentives were offered for 
goal attainment. Thus this condition holds the same 
relationship to Condition A’ as Condition B holds 
to A above. 


All Ss were administered goal-description question- 
naires at the end of the experiment to determine 
whether they had been trying for their assigned or 
self-set goals and to what degree they had been 
trying for the incentives. The responses indicated 
that nearly all Ss were trying for their goals and 
were trying to maximize their earnings. 


Results and Discussion 


The data were initially analyzed using 
Groups A through E only. There were no 
significant performance differences among 
these five groups in raw means either on the 
practice trials (indicating the groups were of 
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equal initial ability) or on Blocks II and MI, 
However, the within-cells variance for these 
groups was inflated due to wide differences in 
ability of the Ss within each group. Thus 
subsequent analyses were done using only 
improvement scores: (a) from the practice 
trials to Block II, and (ö) from Block II 
to Block III. 

The performance means for each group for 
Blocks II and III are shown in Figure la 
and the mean goals for the same groups and 
blocks are shown in Figure 1b (no goals are 
shown for Group D, since they did not set 
explicit goals). In parentheses beside each 
point in Figure 1b are shown the percentage 
of times (trials) on which Ss in that group 
reached or beat their assigned or self-set goals 
on that block. It is evident that the higher 
the goals, the less likely Ss were to succeed 
in reaching them. 

The improvement scores from the practice 
trials to Block II showed no differences 
among the five groups. Since there were no 
differences among these groups on the practice 
trials, and since the goal levels of all groups 
on Block II were similar (see Figure 1b), 
this finding is consistent with Hypothesis 3. 

The improvement scores from Block II 
to Block III tell quite a different story, how- 
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Fic. 1a and b. Performance means and mean goal-level scores for Groups A 
through E, by block (Experiment 1). 


Errects or Monerasy Incentives ox Banava 


| 


eel iji nt 1 ijs 


1 


f e ail li i 


entries yielded 


44 221 231 
74234112 pn ine 2245 
e 
e 5 
$ 22212222 1112751175 i = 
gs E 21 . 1 ; T 
apananta 
F 


ance means and mean goal-level scores for A’ and B' 


. Perf 
Fic. 2a and b. P 2 by block (Experiment 1). 


110 


difference between the two groups either in 
goal-level changes or in performance im- 
provement from Blocks II to III. This 
replicates the similar findings for Group A 
versus B and Group C versus E, above. 

An estimate of the overall between-group 
relationship of performance level to goal level 
was obtained by ranking the 12 performance 
means plotted in Figures la (omitting Group 
D) and 2a, and the 12 mean goal scores plot- 
ted in Figures 1b and 2b. The rank-order 
correlation between these pairs of means was 
85 which is significant at the .01 level. The 
overall relationship between change in goal 
level and performance improvement from 
Blocks II to III was obtained using the 70 
Ss in Groups A, B, C, and E as the units of 
analysis. The cosine-pi estimate of r:e (di- 
chotomizing both variables at the median) 
was .55 which is significant at the .01 level. A 
similar relationship was not found among Ss 
in Condition A’ and B’, however. This is 
probably due to the fact that the range of 
goal improvement scores was less among these 
Ss. Improvement was harder for them since 
they were already at a high level of per- 
formance on Block II. As a result, the im- 
provement scores may have been less reliable 
for these Ss. 

The results of this experiment supported 
Hypotheses 1 and 3. Hypothesis 1 was sup- 
ported by the finding of a significant effect 
of goal level on performance level using 
both group means and individual change 
scores as the units of analysis. Hypothesis 3 
was supported by the finding that, among 
groups which had the same goal level, there 
was no effect of incentive on performance. 
The same goal level produced the same level 
of performance regardless of whether incen- 
tives were offered for goal attainment or not. 
Hypothesis 2 could not be tested in this study 
since there was no initial effect of incentive 
on performance. 

The next study was also concerned with 
level of performance, but was much simpler 
in design. It was decided to compare the 
effects of a piece-rate incentive with those 
of a time-rate incentive. It will be recalled 
that Toppen (1965c) previously found the 
former to produce a higher level of perform- 
ance than the latter. 
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EXPERIMENT 2 


Method 
Subjects 


The Ss were 30 paid, male, undergraduate volun- 
teers recruited from the University of Maryland. 
All Ss were run individually. The Ss were paid as 
indicated below. 


Task 


The task was assembling a number of identical 
objects from Tinker Toy sets. The object to be 
copied resembled a wind vane and was designed 
by the second author to insure a maximum utiliza- 
tion of all the Tinker Toy parts. The S was 
supplied with enough parts to make about 26 
wind vanes, and E could take some apart if S 
made more than this in the time allowed. Boxes 
of like parts were placed in front of S to make 
assembly easier. The S had to make a complete 
wind vane before beginning another (no assembly 
lines were permitted). 


Procedure and Conditions 


The study was introduced as an experiment on 
the development of “manual dexterity” and “manip- 
ulative abilities.’ The S was shown the model and 
asked to make two wind vanes just like the model 
in as little time as possible. The model was placed in 
front of S and he could refer to it as needed. The 
E recorded the time taken to make each vane. At 
this point the experimental conditions were in- 
troduced. 

Piece-rate incentive. Half the Ss were told that 
they would get no basic (minimum) pay but would 
receive 12¢ for every vane they put together in the 
next 50 min. This rate was chosen to allow Ss to 
make $3, on the average, in the 50-min. time limit. 
The S was given his fastest assembly time (of the 
two practice vanes) and told how many he could 
expect to make in 50 min. if he worked at that 
same pace. He then had to indicate to E the num- 
ber of vanes he would try for and the amount of 
money he would try to make. 

During the experiment, E left the room but could 
observe S through a one-way mirror. The E 
recorded the number of vanes completed every 
5 min. and told S how much time had elapsed 
every 10 min. At the halfway (25 min.) point, 
S was given a 2-min. rest while E counted 
the completed vanes in front of S and asked 
what his final goal would be; that is, S set a new 
final goal. fe 

Hourly rate incentive. The procedure for this 
group was basically the same as for the piece-rate 
group, except that Ss were told they would get $3 
for participation regardless of output. The Ss made 
goal ratings and were given feedback as with the 
piece-rate group. 


Results and Discussion 


The two incentive groups did not differ 
significantly in initial ability as measured by 
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Fic. 3a and b. Performance means and mean goal level for incentive groups, 
by period (Experiment 2). 


the speed with which they completed the 
fastest of their two practice models. Per- 
formance for the two groups during the first 
and second 25-min. periods of the experi- 
ment, measured in terms of the total number 
of models completed, is shown in Figure 3a. 
The goal levels of each incentive group for 
the first and second 25-min. periods are 
shown in Figure 3b. Since Ss set their initial 
goals in terms of total output (performance 
for the whole 50-min. period), the first-period 
goals were obtained by dividing their initial 
total performance goals by two. Second- 
period goals were obtained by subtracting 
each S’s first-half performance score from 
his revised total output goal (made just be- 
fore the second half of the experiment began). 
Since this revised total goal was made in 
full knowledge of first-period performance, it 
necessarily included S’s second-half perform- 
ance goal. It is clear from Figures 3a and b 
that Ss thought they could do more and 
actually did more in the second half than in 
the first half. 

The percentages of times the Ss in each 
incentive group beat their Period 1 and 2 
goals are shown in parentheses in Figure 3b. 
Note that within periods, higher goals are 
associated with fewer successes, whereas 
across periods higher goals are associated with 
greater success. The latter finding is no doubt 


a result of learning: as Ss became more 
proficient in the second period, they were 
able to set higher goals and to reach these 
goals more often as compared with the first 
period. 

It appears from Figure 3a that the piece- 
rate group completed more models during 
each of the two periods than did the hourly 
group. However, t tests on these two pairs of 
means showed these differences to be well 
within the range of chance expectancy. 
Similarly, the piece-rate group, as shown in 
Figure 3b, appeared to set higher goals for 
themselves in both periods than did the 
hourly group, but neither of the differences 
exceeded chance level. A ż test on the differ- 
ence of the changes in productivity from the 
first to the second period showed no difference 
between the two incentive groups, nor was 
the difference in goal changes significant. 

The Ss in both incentive groups were com- 
bined in order to compute correlations be- 
tween ability, first- and second-half per- 
formance, and first- and second-half goals. 
These correlations are shown in Table 1. All 
of the first-order correlations between the 
experimental variables were significant at the 
01 level. Although first-half goal correlated 
significantly with first-half performance, when 
ability was partialed out the significant rela- 
tionship disappeared (the partial correlation 
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is shown in parentheses). Second-half goal 
was significantly related to second-half per- 
formance, and the relationship remained sig- 
nificant even with first-half performance 
partialed out (r = .57, p < .01; see Table 1). 
Evidently, at the outset the Ss had not had 
enough experience with the task to base their 
goals on anything other than their initial 
ability scores, whereas their goals for the 
second half were set more in accordance 
with their motivation. 

This relationship between second-half 
goal and second-half performance can be 
demonstrated in another way. The Ss were 
regrouped according to the size of the differ- 
ence between their second-half goals and their 
first-half performance. The 15 Ss setting the 
highest goals for the second half in relation 
to their first-half performance were compared 
with the 15 Ss setting the lowest second-half 
goals in relation to their first-half perform- 
ance. The difference in the mean discrepancy 
between the two selected groups was sig- 
nificant at the .01 level (¢ = 6.92), indicating 
that the division was successful. The first- and 
second-half performance of the two dis- 
crepancy groups is shown in Figure 4. The 
difference in performance improvement from 
the first period to the second period was 
significant at the .05 level (t= 2.68) in 
favor of the high-discrepancy group. 

The results generally support Hypotheses 1 
and 3. The first hypothesis was supported by 
the significant relationships found between sec- 
ond-half goal level and second-half perform- 
ance for all Ss combined and by the sig- 
nificant difference in performance improve- 
ment of the two discrepancy groups. Hy- 


TABLE 1 


CORRELATIONS BETWEEN GOALS AND PERFORMANCE FOR 
ALL SUBJECTS COMBINED 


Goal r 8800 Performance, | Performance, 
half | half first half second half 
Ability 718 7¹ 79 60 
Goal. 
first half 71 -63 (.16b) 62 
oal, 
second half 193 -90 (.57¢) 
Performance, 
first half 85 


Note.—Experiment 2; N 30. 

* All r’s over .46 are significant at $ <.01, 
b With ability partialed out. 

° With first-half performance partialed out, 
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Fic. 4. Performance means for discrepancy groups, 
by period (Experiment 2). 


pothesis 3 was supported by the finding that 
when the goal level of the different incentive 
groups was the same (not significantly differ- 
ent), the effect of incentive condition on 
performance was not significant. Once again, 
the incentives failed to produce any initial 
performance differences so that Hypothesis 
2 could not be tested. 


EXPERIMENT 3 


Method 
Subjects 


The Ss were 22 Trenton State Teacher’s College 
females who were members of an introductory 
Psychology class. The Ss volunteered for the experi- 
ment and were each paid $1 for participation at the 
end. However, they were told at the outset that 
they would get only what they earned during the 
experiment. 


Task and Procedure 


The task was word unscrambling. There were 
6 practice trials followed by 3 blocks of 10 experi- 
mental trials each. On each trial, S$ was allowed to 
choose the length of word she would try to un- 
scramble; & could choose any word length from 
3 to 11 letters each time. The S had to make her 
choice before actually seeing the word but was 
told that words of the same length were equivalent 
in difficulty, After making her choice, § turned the 
page in her work booklet and worked on the word 
length she had chosen for 45 sec. On the first block 
of 10 trials, Ss were not offered any incentive. On 
the second block, they were offered 2¢ for each 
word correctly solved regardless of the length of the 
word. On the. third block of 10 trials, Ss were 
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offered 10¢ for each word correctly solved regard- 
less of the length of the word. 

Before each block of 10 trials and before each 
trial of each block, S described her behavioral in- 
tention for that block or trial by indicating, on a 
5-point scale, whether she would choose “the easiest 
word(s) possible,” “fairly easy word(s),” “words 
that are neither too easy nor too hard,” “fairly hard 
word(s),” or “the hardest word(s) possible” on that 
block or trial. The intention scale filled out before 
each block of words will be called the B-intention 
(for block) scale. The intention scale filled out be- 
fore each trial will be called the T-intention (for 
trial) scale. 


Results and Discussion 


The S’s intention ratings were quantified 
on a 5-point scale (5 for the “hardest words” 
alternative, to 1 for the “easiest words” al- 
ternative). The dependent variable was the 
length of the word, in letters, chosen by S 
on a given trial or the mean word length 
chosen on a given block. 

The mean within-S correlation (using 
Fisher z transformations for averaging) be- 
tween T-intentions and word-length choice 
across the 30 trials was .81 (range = .12—.94; 
Mdn=.80). The mean within-block, be- 
tween-Ss correlation between B-intentions 
and mean word choice was .60 (p < .01). 
The corresponding mean within-block correla- 
tion of mean word choice with the mean 
T-intention was .77 ($ < .01). B-intentions 
and mean T-intentions correlated .69 (p< 
.01), on the average, within blocks. 

The mean word-length choices and inten- 
tion ratings of the Ss fu each of the three 
blocks and the f ratios for enge between 
blocks are shown in Table 2. Ck rly, the 
higher incentives led to easier word choices. 
The proportions indicating the nun ber of 
trials out of 10 on which Ss succeeded in 
solving their chosen words were .60, .62, and 
71 for Blocks I, II, and III, respectively. 
The changes in word-length choice were ac- 
companied by equivalent changes in T-inten- 
tion ratings. The B-intention ratings did not 
completely parallel those for word choice and 
mean T-intentions, but were in the same 
direction. 

Across-block correlations between B-inten- 
tions, T-intentions, incentive, and word choice 
were obtained by treating the data from each 
block as though they were from different 


TABLE 2 


Mans axp $ Rarios ror Cuaxors iw Ixrexnoxs 
axp Worp-Lexorn Cuoice 


— z * — = = 


Means 
| 
Change 2 5 1 
| Block I | Block II | Block III 
(Oe) RA | (109 
B-intentions 291 2.68 2.45 
T-intentions 3.20 2.84 2.35 
Word- length choice 6.32 5.68 5.08 
t ratios for mean change 
Blocks Blocks Blocks 
1 II-III I-m 
B-intentions 131 105 2.35° 
T-intentions 3.34" 3.4” 5.30" 
Word- length choice | 3.98** 3.03" 5.11% 
ee 3; N = 22. 
* <0. 


individuals (thus, the V for observations was 
3 * 22 or 66). The incentive was coded 0, 1, 
or 2 (for the O¢, 2¢, and 10¢ conditions, 
respectively). This gave maximum predict- 
ability to the incentive variable since the 
mean word-choice changes were about equal 
between Blocks I and II, and II and III, 
respectively (see Table 2). The across-block 
correlations between B-intentions, mean T-in- 
tentions, incentive, and mean word choice are 
shown in Table 3. Also shown are the in- 
tention-word-choice correlations with in- 
centive partialed out and the incentive-word- 
choice correlations with intentions partialed 
out. For purposes of significance testing, an 
N of 22 was used in order to be conserva- 
tive (in view of the fact that the 66 observa- 
tions were not independent). The results in- 
dicated that both B-intentions and T-inten- 
tions were significantly correlated with word 
choice both before and after amount of in- 
centive was partialed out. In contrast, while 
amount of incentive was related significantly 
to word choice before partialing, the relation- 
ship was vitiated when T-intentions were 
partialed out, or when both B-intentions and 
T-intentions were partialed out. 

The results of this experiment support all 
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TABLE 3 


RAW AND PARTIAL CORRELATIONS OF INTENTIONS AND INCENTIVE WITH Worp CHOICE 


| 


Word choice 


ees oy Incentive | Partial out 
| | | Rar ] -AN 1 
} 8 3 1 P 5 - ani 

| Incentive | B-intentions | T-intentions Tic tentions 
B-intentions 60˙ — 24 62°* 59% — — 
T. intentions —.50* 84 79 — — — 
Incentive —.81˙ — —.48* — 21 —.22 


Note. Experiment 3, N = 660bs; 22 Ss. 


à Significance levels are based on N = 22. A negative correlation indicates the higher the incentive, the shorter the word 


lengths chosen or the “easier” the intention. 
< .05. 


p <0. 


three of the hypotheses described earlier. 
(1) Intentions were related to choice within 
individuals, between individuals within-blocks 
(where incentive was constant), and between 
individuals across blocks (where incentive 
varied). (2) When incentive differences did 
produce choice differences, these differences 
were accompanied by equivalent differences 
in T-intentions (though not necessarily in 
B-intentions). (3) Finally, when across-block 
differences in intentions were partialed out, 
there was no relationship of amount of in- 
centive to choice. 

The next experiment was similar in design 
to this one except that only the B-intention 
scale was used and only two rather than 
three incentive conditions were included. In 
addition, on the O¢ block in the next experi- 
ment, different instructions were given to 
different Ss and their effects on performance 
and intentions observed. 


EXPERIMENT 4 


Method 
Subjects 


The Ss were 70 University of Maryland, paid, 
undergraduate volunteers. Twenty-seven Ss were 
female and 43 were male. All Ss received $3 for 
participation plus whatever they earned in the 
experiment, 


Task 


The task was word unscrambling as in the previ- 
ous experiment. In the present case, there were 3 
practice trials followed by 2 blocks of 10 experi- 
mental trials each. Again S could choose any word 
Jength from 3 to 11 letters on each trial. The Ss 


chose their word length before actually seeing 
the word, but were told that all words at any 
given length were of equivalent difficulty. 


Procedure and Conditions 


Block I. The Ss were divided at random into 
three instructional groups on Block I. Twenty-three 
Ss were told to “try and achieve as much success 
as possible”; 23 Ss were told to “try and achieve 
as great a sense of personal accomplishment or 
personal achievement as possible”; and 24 Ss were 
told to “try and overcome the greatest possible 
challenges (even if you do not often make it).” 
No incentive was offered for success on this block. 

Before the first experimental trial, Ss filled out 
$-point behavioral intention scales, similar to those 
used previously, to indicate their intended word- 
choice difficulty for the next 10 trials (possible re- 
sponse categories ranged from “very easy words” 
to “very hard words”). 

After this, Ss worked on 10 different word choices 
for 48 sec. each, choosing whatever word length 
they wanted before each trial. The Ss were told 
that they were not committed to their intention 
ratings. 

Block II. On Block II, all Ss were treated 
identically. All were offered 4¢ for each word cor- 
rectly solved in this block regardless of the length 
of the word. Before this block, Ss filled out be- 
havioral intention scales; they also indicated how 
much money they would try to earn on the coming 
plock (level of aspired earnings). Again, Ss worked 
for 45 sec. on each word choice, selecting whatever 
word length they wanted before each trial. 


Results and Discussion 


The behavioral intention ratings were 
quantified on a 5-point scale (5 for the “very 
hard words” alternative, to 1 for the “very 
easy words” alternative). The Block I in- 
structions were quantified on a 3-point scale 
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(5 for “challenge,” 3 for “achievement,” and 
1 for success). The latter quantities were 
based on the authors’ a priori notion of the 
difficulty of the choices these instructions 
would lead to if Ss followed them. The 
dependent variable was the mean word length 
chosen on a given block. 

In Block I, instructions correlated 67 
(p < .01) with mean word choice, while in- 
tentions correlated .88 (p < .01) with word 
choice. The correlation between intentions 
and word choice with instructions partialed 
out was .76 (p< 01); on the other hand, 
the correlation of instructions with word 
choice with intentions partialed out was only 
08 (ns). Block II intentions correlated .81 
(p< .01) with word choice on that block, 
while aspired earnings correlated .60 (p < 
01) with the latter variable. 

Figures 5a-c show the mean total word- 
length choices and mean intention ratings 
on each block, for the Ss in each Block I 
instruction condition (‘success,” “achieve- 
ment,” and “challenge”). Looking first at 
Figure 5a, the group trying for “success” 
on Block I chose easy words on that block 
(mean empirical probability of success = 76) 
and did not change their word choices sub- 
stantially on Block II. The intention-level 
means for these Ss are also similar for Blocks 
I and II. Neither the word choice nor the 
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The ¢ ratios for mean between-block changes 
were significant at the 001 level for both 
the word-length and intention-level measures 
(t's = 4.87 and 4.83, ely). 

The corresponding data for the 


(mean empirical probability o 
33) and much easier words on Block II. The 
between-block mean changes in word-length 
choice and intention level were both signifi- 
cant at the .001 level (¢’s = 7.73 and 9.72, 
respectively). 

The effect of controlling for intention level 


choice and intention level as a function of in- 


centive and instruction condition (Experiment 4). 


116 


separate plots possible for them). The curves 
show a marked effect of intention level on 
word choice, but little effect of incentive on 
choice. None of the incentive effects is sig- 
nificant within any intention level. 

The effects of controlling for intention can 
also be shown by means of partial correla- 
tion techniques. For this purpose, the inten- 
tion, word-choice, and incentive scores from 
each block were treated as though they came 
from different individuals (making an N for 
observations of 2 X the number of individuals 
involved). The incentive variable was coded 0 
for Block I (where the incentive was 0¢) and 
1 for Block II (where the incentive was 4¢). 
Thus, the correlations of incentive with the 
other variables were point-biserial coeffi- 
cients.* The correlations of intention and in- 
centive with word choice, and relevant partial 
correlations, are shown in Table 4 for each 
Block I instruction group and for all Ss 
combined. 

The results show that intention was sig- 
nificantly related to word choice both before 
and after incentive was partialed out, but 
that incentive was not related to word choice 
when intention was partialed out. This find- 

It was believed that a biserial coefficient was not 
justified in this case, since the incentive classifica- 
tion did not represent coarse grouping but the 
real value of the two incentive conditions (0¢ and 
4¢). In addition, the results of Experiment 5 in- 
dicated that the difference between “some incentive” 
and “no incentive’ was considerably greater than 
the difference between different amounts of in- 
centive (eg, 1¢ vs. 2¢). This implies a genuine 
underlying dichotomy. 
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Mean Total Word Length Choice t h 


Fic. 6. Word choice as a function of amount of 
incentive, for three levels of intention (Experiment 4). 


ing was true for each Block I subgroup as 
well as for all Ss combined. 

Like the previous experiment, these results 
support all three of our hypotheses: (1) in- 
tentions were significantly related to choice 
both within and across blocks; in addition, 
level of aspired earnings was correlated with 
choice on Block II where money was offered 
for success; (2) significant incentive effects 
on choice were accompanied by corresponding 
differences in intentions; (3) when the effect 
of intention level was controlled or partialed 
out, there was no relationship of amount of 
incentive to choice. A further finding of in- 
terest was that intentions predicted perform- 
ance better than instructions. When Block I 
instructions were partialed out, there was still 
a significant relationship of intention to word 


TABLE 4 


CORRELATIONS Or INTENTION LEVEL AND Amount oF INCENTIVE WITH MEAN Worp-LENGTH CHOICE, 
ACROSS BLOCKS 


Correlation of mean total word-length choice with 


N 
Subjects r Intention level Amount of incentive 
partial incentive Tpbis Tpbis, partial intention Subjects Observations 
Success 782 78¹ —.08* —.10 23 46 
Achievement 81** JA —.51* —.13 23 46 
Challenge 93** 8277 —.76%* 01 24 48 
All subjects 89 * .86** — 49h" —.10 70 140 


Note.—Experiment 4. 
Negative sign indicates 


the higher the incentive the easier (shorter) the word-length choice, Since there were only two 


values, of meente O¢ and 4¢, r’s for incentive are point - biserial coefficients, 


05 (52884 on N for Ss). 
* 01 (based on N for Ss). 
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choice on that block, but there was no rela- 
tionship of instructions to choice when inten- 


relations, it could affect 
in choice. Second, in both previous studies 
there were five intention 
two or three levels of 
might be argued that incentives 
a “fair shake“ because 
to vary than did intentions. 


ment on choice was to remedy these 
difficulties. à 
EXPERIMENT 5 
Method 
Subjects 


The Ss were 64 University of 
undergraduate volunteers. Thirty 
and 34 were male. The Ss received 
tion plus whatever they earned in the 
(it anything). 


Task 


Fifty Ss 
on each of the five blocks, 
different incentive, either 1¢, 
for each word correctly 
less of the length of the 
five conditions, but the 
incentive was coun! 


each block of five trials, an instruction sheet in- 
formed Ss of what their incenti would be for the 
coming block. They then indicated on a S-point be- 
havioral intention scale (like that used in the 
previous experiment) what their intended choice 
difficulty would be for the coming block of five 
trials, Next, Ss made their first word choice and 
worked on their chosen word length for 45 sec. 
before making their choice for the next and 
so on. The Ss were told that they were not 


Fourteen Ss were i 
These Ss were not 


group curves with 
curves indicate that 

Ss were different. There was a slight but sig- 

nificant tendency within the 

choose casier words when 


A similar effect: was 
vari- 
3.00; p < 05; dj = 4/196). There 

difference between tbe 


of the 
tive Ss, with the latter choosing harder words 
(t=4.13, P< 01). A similar difference was 
found in the mean intention scores of the 
variable-incentive and no-incentive groups 
(t= 3.33, p < 01), with the latter having 
“harder” intentions. The proportions of suc- 
cess in solving the chosen word were 
'52, .68, 67, .71, .79, and .76 for the OF, 16. 
26, 36, 46, and 5¢ incentive conditions, 


y- 

The effect of controlling for intention level 
is shown in Figure $ where word choice is 
plotted as a function of incentive for each of 
three levels of intention. The intention levels 
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correspond to the three “easiest” alternatives 
on the intention scale, except that the top 
category includes all intentions of 3 or higher. 
There was a marked effect of intention level 
on word choice, but very little effect of in- 
centive within any given intention level. 
There was a tendency for no-incentive Ss 
with intentions of 3 or more to choose harder 
words than variable (1¢ to 5¢) incentive Ss 
with the same intentions (f = 3.15, p < .01), 
but the £ ratio may have been inflated due to 
the fact that the observations within each 
group were not independent. (Since the unit 
of analysis was the block, rather than the 
individual, many Ss appeared more than once 
in the analysis.) 

Correlations between amount of incentive, 
intentions, and word choice were obtained 
using independent observations by the fol- 
lowing procedure: for each S, his intention, 
mean word choice, and corresponding in- 
centive scores were chosen for just one of the 
five blocks that he worked on. The data were 
chosen so as to equalize the representation 
of each of the five incentive levels and so as 
to control for order effects. The same pro- 
cedure was used to select data from the no- 
incentive condition, except that only order 
had to be controlled. Thus, the correlations 
between intentions, incentive, and word choice 
are based on only one-fifth of the total data, 
but no S was entered twice in the analyses 
as in the previous two experiments. The rele- 
vant raw and partial correlations are shown 
in Table 5, for the variable-incentive group 
separately and for the variable- and no- 
incentive groups combined. 


Mean Total Word Length Choice (5 Trials) 
Mean Intention Level Rating 


Incentive 


Fic. 7, Mean word choice and intention level as a 
function of incentive (Experiment 5). 
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Fic. 8. Word choice as a function of amount 
of incentive, for three levels of intention (Experi- 
ment 5). 


The correlations between intentions and 
word choice were highly significant (p < .01) 
both before and after partialing out incentive. 
In contrast there was very little relationship 
of amount of incentive to word choice before 
partialing out intentions, and no relationship 
after partialing intention level. 

These findings clearly support those of 
the previous two studies, even with order 
effects controlled and using six different in- 
centive levels: (a) there was a significant 
relationship between intentions and choice 
both within and between incentive conditions; 
(Y) significant incentive effects on choice were 
accompanied by corresponding mean inten- 
tion differences; and (c) when intention 
level was partialed out, there was little or no 
effect of incentive on choice. 

A further finding of interest in this study 
emerged from the analysis of Ss’ postex- 
perimental descriptions of their general in- 
tentions in the experiment. Two independent 
raters rated each intention description on 4 
1-5 scale according to the choice difficulty it 
suggested (e.g, an intention “to make as 
much money as possible” would get a score 
of 1, while an intention to “get a sense of 
accomplishment” would be given a score of 
3). The correlation between the score as- 
signed by the two raters was .90, for those 
51 protocols to which both raters felt con- 
fident enough to assign scores. To obtain 
summary scores for these Ss, the two scores 
were summed. Summary scores were obtained 
for an additional 8 Ss, whose ratings were 
only assigned scores by one rater, by dou- 
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TABLE 5 


CORRELATIONS OF ÎNTENTION LEVEL AND Auooxr or Incentive wrin Miran 
Worp-Lexoru Cnoice, c, BLOCKS 


Correlation of mean total word-length choice with i 


Subjects Intention level Amount of incentive | N 


— E 
Subjects in 1-5¢ condi- 


w 
tion only 
All subjects (1-S¢ and no- “A 


incentive condition 
combined) 


Note.—Experiment 5. 
> Negative sign indicates the higher the incentive the easier (shorter) the word-iength choice. 


* <.01. 


bling the latter score. For these 59 Ss, sum- within and/or between different monetary 

mary postperformance intentions correlated incentive conditions. Further, in each of the 

68 (p < 01) with summary preperformance five studies it was demonstrated that if goal 

intentions (the latter were obtained by sum- or intention level was controlled or partialed 

ming the five intention ratings Ss made out, there was no effect of amount of in- 

during the experiment). Summary preper- centive on behavior. This was demonstrated 

formance and postperformance intentions in the studies of performance level by show- 

correlated .83 (p < -01) and .68 (p < 01), ing that the same goal level produced the 

respectively, with mean word-length choice same performance level regardless of whether 

on the five blocks combined. incentives were offered for performance or 

The former correlation was significantly not. The same fact was demonstrated in the 

greater than the latter (¢ for nonindependent studies of choice by showing that when be- 

rs = 2.60, p < 05), indicating that the post- havioral intention level was partialed out, 

performance measures were poorer pre- the initial effects of incentives on choice 

dictors” of actual behavior than preperform- Were vitiated. Finally, the three studies of 

| ance measures. This would seem to con- choice demonstrated that when incentives 

tradict the claim frequently made by be- did affect behavior, these behavior differences 

~  havioristically oriented investigators that cor- were associated with equivalent differences in 
relations between postperformance measures behavioral intentions. 

and performance are spurious because the The three studies of choice yielded a more 

former are only descriptions (or rationaliza- complete test of the theory than did the 

tions) of behavior. If this were true, the post- two performance-level studies since the de- 

performance measures should be at least as pendence of incentive effects on intention 

highly related to behavior as preperformance differences could be demonstrated directly. 

measures, if not more 50. However, the fact that monetary incentives 

did not show any initial effect in the per- 

ConcLUSION formance-level studies suggests the lesser 


The results of the five studies reported potency of these incentives as compared with 


here are consistent with the hypothesis that performance goals. hae 
goals and intentions are important determi- The first study suggests the possible im- 
nants of behavior, and with the hypothesis portance of the manner in which incentives 
that they are the mechanism by which are arranged. All-or-none incentives made 
monetary incentives influence behavior. All contingent upon reaching à given performance 
| five studies showed significant relationships level were more effective, in this case, than 
| between goals or intentions and behavior the same payment rate made on a piece-rate 
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system. The reason seems clear: the goals 
Ss set themselves under the piece-rate sys- 
tem were lower than those that were assigned 
under the all-or-none system. There is no 
reason to believe that this will inevitably be 
the case, but it does point up the potential 
importance of the manner in which incentive 
systems are structured. Essentially, a piece- 
rate system lets the individual set his own 
overall work and pay goals, while an all-or- 
none payment system requires that he reach 
a specific level of performance before he gets 
anything. 

Interestingly, back in 1911, Taylor im- 
plicitly recognized the potential difference 
between the two types of incentive system as 
well as the relationship of pay to goal setting. 
Under what Taylor called the traditional 
system of management by “initiative and 
incentive” the worker set his own pace, chose 
his own goal, determined his own work 
methods, and was paid accordingly, typically 
on a piece-rate system. Taylor’s scientific 
management system introduced, among other 
things, performance standards or specific 
work goals for workers. Payment of the bonus 
(or the higher rates of pay which accom- 
panied the introduction of his system) was 
contingent upon reaching assigned work goals. 
This goal was inevitably higher than the level 
of performance which had previously existed. 
Employees were told that they were expected 
to earn so much money (which meant pro- 
duce so much) if they wished to remain on 
the job. Taylor called these “high-priced 
men.” 

Opsahl and Dunnette (1966) recognize the 
possible importance of goal setting in arguing 
that it might be beneficial to integrate pay- 
incentive systems with goal setting in indus- 
try, for example, by having pay increases 
dependent upon fulfillment of work goals (p. 
104), or by having individuals actually set 
pay goals. 

It may be hypothesized that assigned all- 
or-none goals or goals formally agreed upon 
by management and the worker will be more 
effective than self-set goals only to the extent 
that: (a) the former goals are set at a 
higher level than the latter, and (b) the 
former procedure results in a greater or more 
intense commitment of the individual to the 
goal. 
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This brings up the problem of the 
specific psychological role that monetary in- 
centives play in motivation. Although such 
incentives appear not to influence perform- 
ance directly, they may serve a number of 
other important functions: 

1, Incentives can influence the nature of 
the intentions one develops on a task. If the 
payoff for succeeding at a hard task is large, 
the individual may be more likely to set a 
hard goal and take a big risk than if the 
payoff is small; or if the payoff is greater for 
succeeding at easier tasks (as in Experiments 
3-5), this may lead him to establish easier 
intentions and to choose easier tasks. 

2. Monetary incentives may persuade an 
individual to accept goals or tasks that are 
assigned to him. In Experiment 1, the 
“demand characteristics’ (Orne, 1962) of 
the experimental situation were sufficient to 
persuade the assigned-goal, no-incentive Ss 
to accept their assigned goals, but such co- 
operation could not be taken for granted in 
most industrial situations. Workers will 
generally not accept a work goal or task 
unless it achieves other valued goals, such as 
to make money. 

3. Monetary incentives may affect the 
degree of commitment of an individual to his 
goal or behavioral intention, Commitment 
may be expected to influence the degree of 
persistence an individual will show in the face 
of difficulty and frustration, the degree to 
which he will retain the goal if it conflicts 
with other goals (e.g., to be liked by co- 
workers), and the probability of his abandon- 
ing the goal altogether and “leaving the 
field” in the face of alternatives. 

It must be stressed that whatever the 
effects of monetary incentives on performance, 
their ultimate impact should be a function of 
the degree to which the individual values 
money as compared to other incentives and 
his perception of the degree to which a given 
course of action is seen as a means of attain- 
ing this value (i. e., the perceived instrumen- 
tality of behavior in gaining the value; 
Vroom, 1964). 

While the present research did not ex- 
plore the evaluation process itself, the find- 
ings did indicate that attention should be 
paid to the goals and intentions which result 
from it. Many incentive programs are 
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based on the implicit assumption that incen- 
tives will affect behavior more or less auto- 
matically. The present findings show such an 
assumption to be vastly oversimplified and 
highly misleading, even in the context of 
laboratory experiments in which there is a 
high degree of control over the situational 
variables. 
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EFFECTS OF CHANGES IN JOB-SATISFACTION 
LEVELS ON EMPLOYEE TURNOVER 


CHARLES L. HULIN * 


University of Illinois 


The results of a program designed to increase the job satisfaction and decrease 
the turnover rate among a large group of female clerical workers are presented 
Results indicated that levels of satisfaction were increased, with the largest 
increases occurring in the satisfaction variables stressed in the program. A sig- 
nificant decrease in turnover (from 30% to 12%) was also observed. Several 
alternative explanations for increased satisfaction and decreased turnover were 
considered, due to the lack of a completely adequate control group. None was 


considered adequate to explain the particular pattern of results obtained. 


Previously reported data (Hulin, 1966) 
indicated the presence of an abnormally high 
turnover rate among the female clerical 
workers employed by the corporate offices of 
a large Canadian manufacturing firm. The 
turnover rate among the female employees of 
this company (30%/yr) was not only high 
on an absolute basis but was high relative 
to 15 other large firms located in Montreal 
who had a mean annual turnover rate of only 
20%. The company involved ranked either 
first or second in turnover rate during each 
of the last 10 yr. among this group of com- 
panies. Many of the ecological factors which 
contribute to high turnover rates (general 
economic conditions of the area, labor market 
conditions, competition from other firms) and 
the characteristics of females in the work 
force (age, marital status, previous experience, 
domestic responsibilities, pregnancy, mar- 
riage) were uncontrolled but were common 
to the other firms in the area who had much 
lower turnover rates. Therefore, considera- 
tion was given to the role job satisfaction and 
dissatisfaction played in contributing to the 
high turnover rate. 

The relationship between job satisfaction 
and turnover was studied (Hulin, 1966). 
Job-satisfaction questionnaires were adminis- 
tered to 345 female clerical workers employed 
at this company. The questionnaire measured 
satisfaction with six aspects of work. A year 
after the questionnaires were administered, 

1The writer would like to thank James Anderson 
and Susan Pfeifer for their assistance on the data- 
analysis stages of this research and Milton Blood 
who read and commented on an earlier draft of the 
paper. 


43 of the girls had quit who had completed 
the questionnaires and who could be identified 
on the basis of birth date, department, lan- 
guage, and marital status. A number of girls, 
who quit during the year following the survey, 
had started work after the survey in July 
1964, a number did not report sufficient 
biographical information on the question- 
naire so positive identification was possible, 
and some had changed departments. 

For each of the 43 terminators, 2 control 
Ss were drawn from the sample of girls who 
had remained on the job. These controls were 
selected so they matched the terminators on 
age, years of education, and mother tongue. 
A comparison of the mean satisfaction scores 
of these two groups indicated the two groups 
were significantly different in terms of group 
centroids. Correlational analyses indicated a 
multiple correlation between the satisfaction 
variables and turnover of .34 (p .01) and 
a significant Pearson correlation between an 
unweighted sum of the satisfaction variables 
and turnover of —.27 (p < .01). 

On the basis of these results the question 
was raised as to whether the turnover rate 
for this company could be reduced by in- 
creasing the job satisfaction of the clerical 
workers. In order to answer this question a 
program was initiated by the company to at- 
tempt to increase job satisfaction in the hope 
of reducing turnover. It is not the purpose of 
this article to analyze the program per se. 
However, a brief description may be in order. 
The results of the scores on the Job Descrip- 
tion Index (JDI) indicated that dissatisfac- 


tion of the clerical workers was occurring in 
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all areas. (The JDI is a cumulative-point, 
adjective checklist type of scale. It has under- 
gone extensive validation research and has 
been described elsewhere—Quinn and 

1967, p. 456; Vroom, 1964, p. 100. The JDI 
was constructed to measure five 

aspects of job satisfaction: satisfaction with 
work done, with the pay, with the supervisor, 
with promotional opportunities and 
and with co-workers.) Analysis of their re- 
sponses to open-ended questions substantiated 
this finding and also indicated that their dis- 
satisfaction with pay was directed as much 
at the wage and salary administration as it 
was at wage levels. Examination of the work- 
ers’ responses on the satisfaction with promo- 
tions scale indicated that the girls felt dis- 
satisfaction was due to their feelings of being 
on “dead-end jobs” with little or no 
for promotion and not due to their 
about the fairness of the policies. These anal- 
yses led the company to revise their wage 
and salary administration policies to make 
them more nearly constant across departments 
and to institute regular salary reviews 
formalize the merit-raise procedure. Also a 


i 
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ing intracompany transfers so the girls could 
move from one department to another where 
they would have 
promoted. In conjunction with this latter 
procedure the girls were encouraged to make 
changes in their own job so that they could 
have a more responsible job within their 
present job classification. The most compelling 
reason for choosing these job aspects to con- 
centrate on in the program was because they 
are the job characteristics which are most 
fully under the control of the company. The 
entire program was put into effect in October 
1964. 

The purpose of the present article is to 
analyze the changes which have occurred in 
this company since July 1964 regarding satis- 
faction scores, turnover rates, and the relation- 
ship between satisfaction and termination 
decisions. 

METHOD 


Research Setting 


The company in which this study was carried out 
has been previously described (Hulin, 1966; Hulin 
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first week of July 1966. By the 
April 1967, 32 giris had quit, of whom 20 
identified. H the of 
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TABLE 1 
MEAN 1964 AND 1966 JDI SATISFACTION SCORES 
Mean 1964 | 1964 Mean 1966 1966 
Variable | scores | percentile scores percentile t 
(n = 345) | equivalents (n = 298) equivalents 
— a | 
Satisfaction with: | | 
Work | 35.33 44 36.11 47 .85 
Pay | 15.01 14 32.83 66 10:277% 
Promotions | 10.78 48 24.58 74 10.38** 
Co-workers 41.53 39 43.40 47 2.455 
Supervision 40.85 48 43.23 60 2.76** 


=p <.05. 
* < 01. 


30% in 1964 before the changes in the com- 
pany’s programs and policies were initiated. 
The turnover rate during 1965 was 18% in- 
dicating that there has been a steady decrease. 

As in the 1964 survey, mean satisfaction 
scores were computed for each of the five job 
areas. Table 1 shows these mean satisfaction 
scores as well as those obtained in 1964. 
t tests of the significance of the difference 
between the 1964 and 1966 means were 
computed. The results of these ¢ tests are 
shown in Table 1. Of the five job areas, 
four—pay, promotion, supervision, and co- 
workers—showed a significant increase in 
satisfaction from 1964 to 1966. Promotion 
and pay satisfaction showed the biggest in- 
creases. For both these variables the increase 
was significant beyond the .001 level. The 
mean satisfaction with work did not increase 
significantly from 1964 to 1966. While this 
analysis (multiple f tests done on correlated 
dependent variables) is not exactly correct 
and yields an underestimate of the «œ level for 
the results of the study, it is the most nearly 
correct analysis considering the hypothesis of 
the study. The concern was not with changes 
in the total set of dependent variables but 
with where the changes occurred. Further, 
a T° analysis, the multivariate analogue of the 
t test (Cooley & Lohnes, 1962), would not be 
appropriate either since the variance-co- 
variance matrices from the 1964 and 1966 
samples were not equal. This, of course, vio- 
lates one of the assumptions of the T? 
analysis. 

The columns headed “1964 and 1966 per- 
centile equivalents” present the percentile 


equivalents of mean scores based, of course, 
on the appropriate numbers of Ss. These per- 
centile scores were computed with reference 
to norms which have been constructed on the 
basis of approximately 650 female workers 
from the United States work force. 

For each of the 16 terminators who could 
be identified, 2 control Ss were drawn who 
were matched in terms of age, years of educa- 
tion, job level, and mother tongue. A linear 
multiple-regression analysis was then per- 
formed on this group of 16 terminators and 
32 control Ss. A linear multiple correlation 
of 45 (p < .05) was obtained. An unweighted 
sum of the five satisfaction scores was cor- 
related —.46 (rpis) ( < .01) with the turn- 
over. 

The multiple correlation of 45 and the 
Tpis Of —.46 predicting turnover indicate that 
even with turnover rates as low as 12% these 
termination decisions bear a relationship to 
the satisfaction of the workers. Further, the 
multiple R of .45 is not significantly different 
than the multiple R of .34 which was obtained 
in the 1964 analysis (Hulin, 1966). This 
indicates that a turnover rate of 12% in this 
setting does not represent the base rate which 
can be expected due to non-job-related 
variables. 


DISCUSSION 


The results of this study indicate that there 
was a decrease in turnover of 18% after the 
changes in the company’s pay and promotion 
policies were instituted. In addition, there 
was a significant increase in satisfaction with 
four out of five of the job areas. Considering 
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these results, one is tempted to conclude that 
the company program initiated in 1964 
brought about an increase in the job satisfac- 
tion of these female clerical workers, and 
that this increase led to a reduction in turn- 
over in 1966. However, since this was not a 
true experimental study (Campbell & Stanley, 
1966, refer to it as a “pre-experimental” 
design which has confounded extraneous vari- 
ables), several alternative explanations must 
be considered. 

One explanation which could account for 
the decrease in turnover without attributing 
it to increased satisfaction is history. A change 
in the labor-market conditions from 1964 to 
1966 may have occurred. It could have been 
much easier to get a job in 1964 than it was 
in 1966. If a girl was dissatisfied with her 
job, she could quit and easily find another 
job. However, an examination of the condition 
of the labor market for these 2 yr. indicates 
little change. One would expect that changes 
in the labor market would be reflected in the 
turnover rates of the other companies located 
in Montreal. The average annual turnover 
rates for the other 15 large companies in 
Montreal were 18.4%, 20.2%, and 20.0% in 
1961, 1962, and 1963. These turnover rates 
have not decreased during 1964, 1965, and 
1966. The average turnover rate among these 
15 companies was 24.3% in 1966. An explana- 
tion for the decreased levels of turnover based 
on changes in the labor market conditions 
can be rejected. 

One could explain the increases in job- 
satisfaction scores from 1964 to 1966 as a 
regression to the mean phenomenon. How- 
ever, this argument loses potency when the 
percentile equivalents of the 1964 and 1966 
satisfaction scores are examined. Only the 
nonsignificant change in work satisfaction 
(forty-fourth to forty-seventh percentile) and 
the change in satisfaction with co-workers 
(thirty-ninth to forty-seventh percentile) 
would fit this explanation. Futher, the change 
in turnover rate should not be attributed to 
a regression phenomenon since the turnover 
rate had been stable for over 10 yr. For the 
5 yr. preceding the study the turnover rate 
fluctuated between 30.0% and 31.3%. To 
argue that the rate suddenly regressed to 12% 
stretches the notion of statistical regression. 
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Another explanation which could account 
for the results of this study is that there were 
changes in the composition of the female 
clerical workers employed at this company 
from 1964 to 1966. A certain pattern of these 
employee characteristics might predispose the 
worker to remain on the job and also to be 
more satisfied with her job. This pattern could 
be more frequently found among the 1966 
clerical staff. If this explanation is correct 
then one would expect to find differences be- 
tween the composition of the 1964 and 1966 
clerical staff. However, a comparison of the 
compositions of the two staffs in terms of 
age, job level, years of education, and mother 
tongue revealed no such differences. 

It would also be difficult to postulate 
random factors which could explain the par- 
ticular pattern of results obtained. To argue 
that one set of uncontrolled factors accounted 
for the large increases (f ratios of 10.27 and 
10.38) in pay and promotion satisfaction, 
which were the two job aspects concentrated 
on in the program, and small increases in the 
three areas not stressed and that other un- 
controlled factors simultaneously led to de- 
creased turnover strains credibility. 

Other problems of internal validity resulting 
from this type of study design do not seem 
particularly relevant. Factors such as motiva- 
tion, instrument decay, mortality, or selection 
would not seem to account for the findings. 

The final remaining weakness of this design 
would be in the use of reactive measures 
(Campbell & Stanley, 1967; Webb, Campbell, 
Schwartz, & Sechrest, 1966). However, if the 
simultaneous reduction in turnover and in- 
crease in satisfaction are due purely to ob- 
trusive measurement in combination with an 
otherwise ineffective program, then the mil- 
lennium has indeed arrived. For if this ex- 
planation were true, then a combination of 
obtrusive measurement and any poorly de- 
signed, inappropriate program will solve turn- 
over problems. This does not seem reasonable. 

The major remaining question has to do 
with the relationship between job satisfaction 
and turnover under the conditions of reduced 
turnover rates. With a turnover rate of 30% 
in 1964, a multiple correlation of .34 was 
obtained. With a turnover rate of 12% in 
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1966, a multiple correlation of AS and a 
biserial correlation of —.46 were obtained. 
The expectation was that a lower relationship 
between satisfaction and turnover would be 
obtained with reduced turnover levels. These 
findings indicate that the difference in the 
multiple correlation is nonsignificant. Even 
with a turnover rate of 12% a significant 
amount of the variance in individual termina- 
tion decisions is attributable to differences in 
satisfaction. 
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ACHIEVEMENT VALUES, SOCIAL DESIRABILITY, 
AND ENDORSEMENT OF TRAIT NAMES ON 
THE BERDIE CHECK LIST 
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Using a forced-choice test of achievement motivation (v Ach), called the 
Sentence Completion Test (SCT), 100 St were classified into High v Ach 
and Low v Ach groups, All Ss took the Marlowe-Crowne Social Desirability 
scale and the Berdie Adjective Check List. It was found that Ss of High 
v Ach group more frequently endorsed achievement related adjectives of the 
checklist, The correlation between SCT scores and frequency of endorving 
achievement-related adjectives turned out to be significant even after the 


effects of social desirability and 


relation between the later 2 reponse- 
Implications of the results are discussed. 


The adjective checklist method has recently 
aroused some interest among workers en- 
gaged in personality research, The adjective 
checklist has some advantages over some 
other self-report techniques of personality 
assessment. It takes only a few minutes for 
most Ss to check the list even though it may 
contain as many as 200 items. A distinct ad- 
vantage of this method is that Ss usually are 
not reluctant to perform this kind of task. 
According to Gough (1955), the adjective 
checklist method has three advantages. As 
with the association technique, a wide seg- 
ment of personality can be readily measured. 
The familiar words in the list provide a 
meaningful task for the respondent. Lastly, 
the endorsement of various items assures 
analytic ease and may stimulate research 
workers to derive various scoring keys from a 
particular checklist such as Gough’s (1952) 
Adjective Check List (ACL). 

Heilbrun (1958) developed 15 rationally 
derived need scales for the ACL. In one of 
his validational studies (Heilbrun, 1959) he 
found that high-achieving Ss, as defined by 
relatively high grade-point averages (GPAs) 
with estimated intelligence held constant, en- 
dorsed more frequently those statements 
which reflected need for achievement. How- 
ever, in this investigation no attempt was 
made to control the social desirability factor 

1 The author would like to thank Chandra Kishore 
Pd. Sinha for collecting part of the data reported in 
this paper. 


acquiescence were partlakd out, The cer- 
set variables was found to be high 


(Edwards, 1957), although Heilburn (1958) 
himself had demonstrated earlier that fre- 
quency of endorsement of adjective need clus- 
ters is highly related to judged social desir- 
ability of the adjective characteristics of the 
need. Edwards (1953) has reported a high 
positive correlation (r = .87) between social 
desirability value of a trait name and the 
probability that the trait will be endorsed. 
Quite recently, there have been several 
empirical and theoretical papers directed 
specifically toward the problem of possible 
relationships between social desirability and 
endorsement of trait names of closely related 
self-descriptive statements from Q-sort pools 
and inventories like the Minnesota Multi- 
phasic Personality Inventory (MMPI). A re- 
view of such studies will be found in Edwards 
(1957). Jackson and Messick (1961) found 
that nearly three-quarters of the common 
variance and more than half of the total 
variance in scores on MMPI could be at- 
tributed to social desirability and acquies- 
cence. According to Edwards, Walsh, and 
Diers (1963), “The probability of a True 
response to a personality item is a linear in- 
creasing monotonic function of the social de- 
sirability scale value of the item [p. 255].“ 
Kenny (1956) determined the mean social 
desirability rating (SD) based on grouped 
data for each of the 25 trait-descriptive ad- 
jectives and found that the SD values cor- 
related as high as .81 with the probability of 
item endorsement established on the basis. 
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of self-rating obtained from a separate 
sample 

The data reported above strongly suggest 
the need to control the SD factor in studies 
designed to test the hypothesis of differential 
patterns of item endorsement by groups classi- 
fied in terms of a particular personality di- 
mension. Groesbeck (1958), for example, 
found that persons who were high in both 
n Achievement and n Affiliation scales, rated 
themselves significantly higher on the Effec- 
tive Intellectual Functioning scale (developed 
by Gough from MMPI items) than groups 
having a different configuration of the above 
two motives. Martire (1956) found that among 
Ss classified according to strength and gen- 
erality of achievement motivation, people with 
low need for achievement (n Ach) endorsed 
to a lesser extent a number of trait names re- 
lated to achievement, sociability, emotional 
maturity, etc., than Ss who obtained high 
n Ach scores whether achievement-related 
cues were present or not. Martire’s (1956) 
data further suggest that “each group tends 
to see as characteristic, to some degree, those 
traits which they consider important [p. 
370].” To what extent the reported differences 
between the criterion groups with respect to 
self-description are free from the effects of 
social or personal desirability of the trait 
names endorsed is a problem that needs 
systematic study. A possible difference due 
to acquiescent response set or “Yes-saying” 
has also not been assessed so far. Any at- 
tempt to determine the relationship between 
endorsement frequency of self-descriptive 
adjectives and scores on a particular per- 
sonality dimension would have to be con- 
trolled both for SD and acquiescence. 

The present investigation is a preliminary 
effort in this direction. The primary purpose 
of the study was to test the hypothesis that 
people with high verbalized need for achieve- 
ment (v Ach) tend to endorse more achieve- 
ment-related trait names than persons with 
low v Ach even after the need to present 
oneself in a socially approved fashion and 
acquiescent response set are held constant. 


METHOD 
Subjects 


The Ss used were 60 undergraduate and 40 
postgraduate students between the ages of 18 and 
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25 yr. The sample consisted of 37 males and 63 
females studying in different colleges and graduate 
departments of Patna University, Patna, India 
during the 1964-65 session. The mean age of the 
sample was 20.39 yr. The Ss were all unpaid volun- 
teers. 


Instruments 


The adjective checklist used in this investigation 
was the Femininity Adjective Check List (BACL) 
developed by Berdie (1959). This checklist contains 
a total of 148 adjectives printed in two columns 
with space next to each item for checking if S 
thought that particular adjective applied to him 
Berdie (1959) has derived a femininity scale on 
the basis of Ss’ endorsement of 61 items included 
in the list of 148 adjectives. The test-retest re- 
liability of the femininity scale, as reported by 
Berdie (1959), is 81 for a sample of 95 men. 
Out of these 148 items of BACL, only 21 adjectives 
were regarded unanimously by 10 graduate students 
in psychology to be directly related to different 
aspects of achievement motivation. Endorsement of 
each of these 21 items by S was credited with 1 
point. 

The v Ach scores were obtained from the 
Sentence Completion Test (SCT) which was de- 
veloped by Mukherjee (1964) to assess specifically 
consciously recognized achievement values among 
American college students. The SCT has also been 
found usable for Indian college students (Mukherjee 
& Verma, 1966) and has recently been used in a 
number of studies conducted in India. Details con- 
cerning its construction, administration, and various 
psychometric properties are reported by Mukherjee 
(1964). Basically, the SCT consists of 50 forced- 
choice triads (one item reflecting achievement-related 
sentiment and the other two pertaining to other 
aspects of manifest needs), selected in such a 
manner as to minimize the SD factor. Examples 
of the forced-choice triads included in SCT appear 
elsewhere (Mukherjee, 1968). The S’s score on SCT 
is the number of times he has chosen a statement 
reflecting any 1 of the following 10 aspects of 
achievement motivation: (a) hope of success, (b) 
fear of failure, (c) high standard of excellence, (d) 
sense of competition, (e) optimism, (f) perseverance, 
(g) concern for creative work, (h) preference for 
difficult and challenging tasks, (i) interest in making 
future plans, and (j) identification with a successful 
authority. The test-retest reliability of the SCT after 
an interval of nearly 2 mo. has been found in various 
studies to range from .71 to 83. The test-retest Cor- 
relation has been found to be almost .75 for a sample 
of 100 Indian male graduates to whom the test 
was readministered after a month and a ha 
(Mukherjee & Sinha, 1967). Previous studies 
(Mukherjee, 1964, 1965a, 1965b, 1965c, 1968) have 
demonstrated the concurrent and construct validities 
of the SCT as a measure of v Ach. A person high 
in v Ach, as measured by the SCT, is one who by 
word habitually indicates a desire to compete suc- 
cessfully with a standard of excellence. There 15 
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TABLE 2 


PRODUCT-MOMENT INTERCORRELATIONS AMONG 
MEASURES OF V ACH, SOCIAL DESIRABILITY, 
ACQUIESCENCE, AND FREQUENCY OF 
ENDORSING ACHTEVEMENT-RELATED 
Trarr NAMES 


Endorsement | . 
of achieve- pe M-C 
Variable ment- 90 8 2 | SD 
related ments | Scale 
adjectives 
Sentence Com- 
pletion Test 56 09 15 
Frequency of 
endorsement of 
achievement 
adjectives 17 20 
Total no. endorse- 
ments of adjectives 67 


Note.—N = 100, 7 = .254 required for significance at the 
-O1 level. For. Os level, the required value is. 105. 


appear in the third column of Table 1. A 
critical ratio test based on Z statistic was 
also run for each item in order to test the 
significance of the differences between two 
independent proportions of item endorsement. 
The results of the test of significance (Z 
statistics) are shown in the last column of 
the same table. 

In Table 2 are given the product-moment 
intercorrelations among the major variables 
of the present study. From the intercorrela- 
tions, a few first-order partial correlations 
were also computed and they are as follows: 
112.3 = 95, 712.4 = .55, fisa = —.014, faga = 
.049, the first two of which are significant at 
the .01 level. In order to determine the 
intrinsic relationship between achievement 
values (v Ach) and frequency of endorsing 
achievement-related traits, a second-order 
partial correlation was computed in which the 
effects of acquiescence as well as social de- 
sirability were eliminated. The obtained 
coefficient 712,34 turned out to be .56 which 
is significant at the .01 level. 


Discussion 


The main purpose of the study was to ex- 
amine whether or not persons classified as 
having higher achievement values (v Ach) 
endorse achievement-related traits more fre- 
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quently than those who were found to have 
a low achievement need. The findings of the 
investigation clearly demonstrate that for 
purposes of self-description the High v Ach 
group chose more frequently the following: 
(a) ambitious, (b) assertive, (c) capable, 
(d) confident, (e) determined, (/) enter- 
prising, (g) forceful, (2) foresighted, (i) 
gracious, (j) imaginative, (k) independent, 
(l) industrious, (m) initiative, (n) painstak- 
ing, (o) persevering, (y) rational, (q) serious, 
(r) sincere, (s) spontaneous, () steady, (u) 
straightforward, and (v) thoughtful. The Ss 
of the Low v Ach group described themselves 
more frequently as “slow” and “submissive” 
than Ss of the High v Ach group. 

In a previous self-rating study, Mukherjee 
(1965c) also found that Ss having high 
achievement values tended to rate themselves 
significantly higher on items pertaining to 
perseverance, industriousness, and diligence. 
No significant differences were found with 
respect to dimensions such as sociability, intel- 
ligence, emotional stability, etc. The findings 
of the present study are therefore much in 
line with the previous results. It is generally 
believed that persons with high need for 
achievement tend to be independent and this 
is supported by the study reported by Cran- 
dall, Katkovsky, and Preston (1960). Winter- 
bottom (1958) has also reported some data 
which suggest that high achievement motiva- 
tion is associated with early maternal en- 
couragement for independence training. Such 
a relationship may also be true for achieve- 
ment values (v Ach). Thus, the higher fre- 
quency of endorsement on the part of the 
High v Ach group of trait names such as 
independence, initiative, and spontaneity as 
found in the present study indicates more 
or less the same results established in the area 
of need for achievement (n Ach). 

It is interesting, however, to note that 
the differences between the two criterion 
groups did not turn out to be statistically 
significant so far as their endorsement of the 
following trait names is concerned: active, 
anxious, bold, conventional, dependent, vigor- 
ous, etc. Mukherjee and Sinha (1967) re- 
ported a significant negative relationship be- 
tween v Ach and manifest anxiety. It is pos- 
sible that the connotations attached to these 
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adjectives by the Ss of the present study may 
be at variance with the conceptual meaning 
of the respective dimensions employed in 
deriving the various tools used in the studies 
reported above. 

The first important observation emerging 
out of the present study is that the persons 
having high scores on the SCT endorsed sig- 
nificantly more of the total 21 achievement- 
related items of the checklist than did those 
who scored low on the SCT. The correlation 
between SCT scores and frequency of en- 
dorsing achievement-related items also turned 
out to be significant even after the effects 
of both acquiescence and SD were partialed 
out, as was hypothesized. 

In the light of the present confusion over 
what it is that personality assessment in- 
struments are measuring, it is relevant to 
question whether the significantly higher en- 
dorsement of achievement-related traits by 
the High v Ach group reflected basic person- 
ality differences or were merely artifacts of 
response styles. Christie and Lindauer’s 
(1963) recent review points out that most 
existing scales and inventories are affected 
by both SD and tendency to agree. The re- 
sults of the present study clearly show that 
acquiescent response set does not seem to 
strongly influence scores either on the BACL 
or on the SCT. In so far as SD is concerned, 
it has already been reported that the SCT 
and the M-C SD scale are essentially un- 
correlated (Mukherjee & Sinha, 1967). The 
correlation between the two variables also 
turned out to be insignificant in the present 
study. It seems then that the attempt to 
minimize the SD factor in the process of con- 
structing the SCT has not been futile. 

Thus, one implication of the findings of the 
present study is that achievement-oriented 
self-perception and acquiescence set are not 
highly related. The study also reveals that 
the correlation between SD and tendency for 
“yes-saying” is high enough (cf. Solomon & 
Klein, 1963) to support the contention of 
Webster (1962) that “these two ‘dimensions’ 
are functionally integrated into a larger 
syndrome within personality [p. 789].” 

The results also indicate that, as a group, 
the achievement-oriented assessments college 
students make of themselves are not very 
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different from the ones made by an objective 
test of personality. This suggests that the 
BACL is an instrument sensitive enough to 
differentiate between those with high achieve- 
ment values and those with low achievement 
values. 

The present study also affords additional 
evidence of the construct validity of the SCT 
as a measure of achievement orientation by 
relating individual differences in value achieve- 
ment to preference for endorsing achieve- 
ment-related adjectives on the Berdie Check 
List. The findings in general support the 
hypothesis proposed: that what has been 
described as typical achievement value is a 
significant variable affecting one’s preference 
with regard to self-descriptive trait names 
having achievement connotations. In conclu- 
sion, the results indicate that the forced- 
choice measure of verbalized need for 
achievement called SCT can not only success- 
fully isolate the presumed differential strength 
of achievement values in groups but can also 
successfully predict the probability of en- 
dorsement of achievement-related trait names 
for a group even when the SD factor operates. 
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2 experiments investigated the relative effects of critical signal rate, normal 
noncritical signal rate, and critical signal probability on detection in a visual 
vigilance task. In all conditions Ss had to monitor a clock display for a period 
of 54 min. The results of both experiments suggested that normal signal rate 
may be of greater importance in a vigilance situation than either critical signal 
rate or critical signal probability. In general, both percentage of detections and 


percentage of stimuli falsely respond 
signal presentation, 


Although numerous experiments have in- 
dicated that probability of detection is di- 
rectly related to critical signal rate, recent 
studies of vigilance behavior have suggested 
that additional rate variables, such as non- 
critical signal and total stimulus rates, may be 
of equal or even greater importance in de- 
termining vigilance performance. Colquhoun 
(1961) observed that many of the studies 
which found a direct relationship between 
performance and frequency of critical signal 
presentation (Jenkins, 1958; Kappauf & 
Powe, 1959) had utilized tasks in which an 
increase in the number of critical signals was 
simultaneously accompanied by a decrease 
in normal or noncritical signals. Accordingly, 
Colquhoun investigated the separate effects of 
both critical and normal signal rates and 
concluded that detection efficiency was de- 
termined by the probability that a stimulus 
will be a critical signal, that is, number of 
critical signals divided by the total number 
of stimuli presented. Subsequent studies 
(Jerison & Pickett, 1964; Jerison, Pickett, & 
Stenson, 1965; Johnston, Howell, & Gold- 
stein, 1966) have confirmed the general find- 
ings of Colquhoun by obtaining differences in 
performance which were directly related to 
critical signal probability. On the other hand, 
the results from a recent study by Jerison 
(1965) have suggested that the main variable 
affecting performance may be the number of 
normal signals (noncritical) and not critical 
signal probability. Jerison’s (1965) findings 
indicated that under certain conditions per- 


1 The author is also at the State University of 
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led to were inversely related to normal 


centage of detections decreased as the num- 
ber of noncritical signals increased, even 
where critical signal probability was equal. 

Since each of the above studies (Colquhoun, 
1961; Jerison et al., 1965) utilized only two 
or three groups and thus only two comparison 
levels of each of the relevant variables, the 
discrepancy between findings might be due to 
the particular levels which were investigated. 
Thus, the present experiments were designed 
to provide further insight into the relative 
effects of critical signal rate, normal signal 
rate, and critical signal probability. This was 
accomplished in Experiment I by investigating 
the effects of critical signal and total stimulus 
rates at various signal probabilities and in 
Experiment II by using a constant critical 
signal probability level derived from several 
combinations of critical and noncritical signal 
rates. 


EXPERIMENT I 
Method 


Subjects. Data are reported on a total of 
undergraduate males who participated as part 
the requirements of the introductory course 
psychology at Syracuse University, Data from nine 
additional Ss were discarded due to equipment 
failure (V= 4), not responding (V=3), or con- 
tinuous random responding (N = 2). 

Apparatus. The visual display was a Type S-1 
Standard Electric clock with a 1-rps sweep rate 
which had been modified as follows: (a) the in- 
strument face was reversed and painted white; (b) 
clock face diameter was reduced from 34 to 2? in. 
by painting the outer rim of the glass display sur- 
face white; and (c) the black sweep hand was 
painted white toward the center of the display, 
leaving only a -in. black portion at the tip. 
‘All three modifications were based on preliminary 
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testing and were designed to increase the difficulty 
of the vigilance task. 

As in other clock-type vigilance tasks, pulses 
were presented at fixed rates which allowed the 
sweep hand to move or jump a specified distance. 
The normal or noncritical signal consisted of a 9° 
movement, while the critical signal to be detected 
by S had a sweep-hand movement of 20°. Thus, 
S's task was to detect a “double jump” which was 
actually 2.2 times the distance of the normal non- 
critical signals, Detection responses were to be made 
on a telegraph key located directly in front of S. 

Two identical clocks were modified and placed 
in adjoining rooms to allow for simultaneous testing 
of two Ss. Each clock was positioned at one end 
of a table with an acoustically tiled wall behind it. 
The S was located at the other end of the table 
approximately 4# ft. from the clock. Further, to 
reduce possible noise and vibration transmission, 
each clock was placed upon acoustic-absorbing 
material. Finally, to insure complete acoustic isola- 
tion and a uniform acoustic environment, Ss wore 
earphones and received 75 db. of white noise through- 
out the testing session. 

All programming and recording equipment were 
located in an adjoining control room. The rate of 
stimulus presentation was controlled by Grason- 
Stadler Programming modules, while critical signal 
rate and onset were controlled by a Gerbrands tape 
Programmer. An Esterline Angus 20 pen Model 
recorder and a Grason-Stadler Print-Out counter 
were used to record each signal and each S’s re- 
sponse. Formal instructions and white noise were 
prerecorded and played back through Ss’ earphones 
by a Wollensak tape recorder. 

Signal schedules. Stimuli were presented at two 
fixed rates of 30 and 60 stimuli per minute (ie, 
normal plus critical signals), providing a total of 
1,620 and 3,240 stimuli, respectively, for the 54-min. 
session. The preprogrammed film tapes provided 
that 27, 54, or 108 of the stimuli presented in each 
test session were critical signals while the rest were 
normal signals. Each Program tape was set up with a 
time base of 2 min., such that within each critical 
signal rate the same number of signals were presented 
in each 2-min. block, that is, one, two, or four 
critical signals per 2 min. Critical signals were pro- 
grammed randomly within the restrictions imposed 
by the 2-min. blocks. Table 1 Presents the six 
combinations devised from the three critical signal 
and the two total stimulus rates and, for each 
condition, further illustrates the probability of a 
stimulus being a critical signal. 

Procedure. The Ss were randomly assigned to 6 
groups with 12 Ss per group. Each group served un- 
der a different combination of total stimulus rate 
and critical signal rate. All Ss served for 54 min., 
thus providing a measure of performance for three 
18-min. time blocks. 

At the beginning of the testing session, Ss were 
given informal instructions which drew attention to 
the clock and key and which provided stimulus 
samples. The Ss continued to receive sample stimuli 
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TABLE 1 


CRITICAL SIGNAL PROBABILITY ror Each or 
THE 6 EXPERIMENTAL CONDITIONS 


| Critical signals 
Total stimuli | — 
| 


27 | 54 108 
1620 0166 .0333 0666 
3240 | .0083 0166 0333 


until they could discriminate between the normal 
and critical signals. At this time Ss were told to 
put on their earphones and were given the formal 
pretaped instructions for the vigilance task. The 
formal taped instructions provided further examples 
of the normal and critical signals and instructed Ss 
that their job was to press the telegraph key as 
soon as a critical signal was detected. The Ss were 
further informed of the white noise and were re- 
quested to remain seated and to continue the task 
until E entered the room to terminate the session. 

All watches were removed before the testing 
session began so Ss did not receive any time cues, 
although they did know that the session would last 
“about an hour.” 


Results 


A single response-time criterion was used 
for all groups even though the duration of the 
interval between successive stimuli was in- 
versely dependent upon rate of stimulus 
presentation. Since Ss tended to pace their 
responses to the off-intervals, the single cri- 
terion was based upon the duration for the 
groups with the slowest stimulus rate. Thus, 
a response to a critical signal was considered 
as being correct only if it occurred within a 
2-sec. interval following signal presentation. 
All responses which occurred outside of this 
interval were considered as false detections. 

Correct detections. The number of critical 
signals detected by each S during each of the 
three 18-min. time periods was converted to 
percentage correct by dividing the number 
of signals detected by the number of signals 
presented. A Hartley’s test of homogeneity of 
variance (Winer, 1962) was run using the 
between-groups effects of critical signal rate 
and total stimulus rate and indicated that the 
assumption of homogeneity was tenable and 
that all groups could be run in a single anal- 
ysis of variance. The analysis of variance of 
the data indicated that the only significant 
effects were total stimulus rate, F (1/66)= 
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14.25, p < .01, and time periods, F (2/132) 
= 36.62, p < .01. No consistent differences in 
performance were attributable to critical 
signal rate or to the interactions between the 
variables. 

Figure ! presents mean percentage of cor- 
rect detections for each of the six groups 
during each of the three 18-min. time periods. 
The main effect of time periods is clearly 
shown by the fact that all groups made less 
detections in each successive period. Similarly, 
the data underlying the significant effect of 
total stimulus rate indicate that performance 
was poorer with the faster rate of stimulus 
presentation as compared to the slower rate. 
The lack of interaction between these variables 
suggests that, in general, the increase in total 
number of stimuli did not affect the rate of 
vigilance decrement but, in effect, lowered 
performance in all time periods. 

Of further relevance to the evaluation of 
the various rate parameters in vigilance per- 
formance are the findings that, although in 
the predicted direction, the effects due to 
critical signal rate were not sufficiently large 
or consistent to achieve statistical significance. 
These findings plus the significant effect of 
total stimulus rate suggest that the rate of 
noncritical or normal signal presentation may 
be the most important of the variables under 
question. That is, the combined results in- 
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Fic, 1. Percentage of detections as a function. of 
total stimuli, and 18-min. time 


critical signals, 
periods. 


Conversion of the false-report data into 


F (1/66)= 9.46, p < 01. Mean percentages 


b; 
normal and critical signals in a manner which 
maintained a single critical signal probability 
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level. Three predictions could be made with 
this design: (1) if probability level is the 
most important factor, then rate of stimulus 
presentation should have no effect upon per- 
formance; (2) if noncritical signal rate is the 
most important variable, then detection per- 
formance should vary inversely with rate of 
stimulus presentation; and (3) if critical 
signal rate is of greatest importance, then 
performance should vary directly with stim- 
ulus presentation rate. 


Method 


The Ss consisted of 20 male and 20 female under- 
graduate students at Syracuse University. Data 
from seven additional students were discarded due 
to equipment failure (NV =6) or refusal to complete 
the session (N = 1). 

Five male and five female Ss were randomly as- 
signed to four stimulus rate groups which were 
derived from different combinations of normal 
signal rate and critical signal rate. These combina- 
tions, which will be written in the form “critical 
signal rate/normal signal rate,” are 216/6264, 108/ 
3132, 54/1566, and 27/783. All combinations have 
the identical signal probability of .033; that is, the 
ratio of total critical signals to total stimuli is 
033 in all four combinations. 

The basic apparatus, task instructions, and 
methods of making the stimulus tapes were identical 
to those used in Experiment I. In all cases, S’s 
task was to depress the telegraph key upon detec- 
tion of a critical signal. Similarly, all Ss served 
for a total of 54 min. 


Results 


As in Experiment I, the criterion for a 
correct detection response was determined by 
the group which had the longest duration be- 
tween the presentation of succeeding stimuli, 
that is, Group 27/783 with 810 total stimuli 
per 54 min. and 1 every 4 sec. Thus, in the 
present experiment the cutoff time was 4 sec., 
and all responses were counted as being cor- 
rect if made within 4 sec. after a critical 
signal was presented. All other responses were 
considered as being false detections. 

Correct detections. The Hartley test in- 
dicated homogeneous variances for the four 
experimental groups. Analysis of variance of 
percentage of detections indicated that stim- 
ulus rate groups, F (3/32)= 6.41, p < .01, 
time periods, F (2/64)= 18.52, p < .01, and 
the Time Periods X Sex interaction, F (2/64) 
= 3.29, p< .05 were the only significant 
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TABLE 2 


Perscerrace or Carticat Stax ats Corerctiy De- 
TECTED AND PERCENTAGE OF Nou Siowats 
nien Recetven Couutsstox RESPONSES 


Group Correct detections) False responses 
216/6264 43,52 1,21 
108/3132 44.64 1.45 

54/1566 | 72.62 | 8.9 

27/783 | 61.86 | 7.31 


effects. As in Experiment I, the interaction 
between time periods and stimulus rate was 
not significant. 

Table 2 illustrates the data underlying 
the findings for the stimulus rate groups and 
indicates that performance was inversely re- 
lated to the rate at which the normal or 
noncritical signals were presented. More 
specifically, the table suggests the two slowest 
critical signal/normal signal combinations led 
to better performance than obtained with the 
two faster rates. Further, the lack of in- 
teraction with time periods suggests that these 
differences were apparent in all three 18-min. 
blocks. Figure 2 presents the data for the 
Time Periods X Sex interaction and suggests 
that although there was a general decrement 
in performance over time, the rate of decre- 
ment was greater for males than females. 

False detections. The Ss exhibited a large 
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Fic. 2. Percentage of detections for male and female 
subjects as a function of 18-min. time periods. 
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amount of variability in the number of false 
detections, with 20% (eight Ss) making 65% 


1,566 group was more than twice that of the 
other groups, this difference is attributable 
to the error total of one S, which, in fact, 


Finally, although number of false reports 
appeared to decrease with time, an analysis 
indicated that this decline was not significant. 

Conversion of number of false reports Into 
percentages by dividing by the total number 
of normal signals which could be responded 
to with the 4-sec. response criterion Indicated 
that stimulus rate groups was a significant 
effect, F (3/36)= 3.75, p< os with this 
measure, That is, since the t of com- 
missions appeared to be similar, the fewer 
the number of signals presented, the higher 
the percentage which was responded to by S$. 
Table 2 indicates that the general trend of 
these data closely resembles that for correct 
detections. 


Discussion or Exprrmmnts I ano M 


In general, the present results suggest that 
both percentage correct detections and per- 
centage of stimuli falsely responded to are 
inversely related to the number of normal 
signals which are presented in a session. 
Further, the direction of the results in both 
experiments indicates that under the 
conditions neither number of critical stimuli 
nor the probability of a stimulus being 
critical signal had any significant effect upon 
detection performance. Taken together, these 
findings cannot be explained by either the 
expectancy or the arousal theories which 
would lead to predictions which are op- 
posite to the present results. That is, = 
tions from arousal theory would lead one to 
expect less arousal and thus poorer perform- 
ance with low rates of normal signal presenta- 
tion. Similarly, the use of an expectancy hy- 
pothesis would predict, in Experiment TI, 
overall detection performance to be directly 
related to critical signal rate and amount 
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made using 0.5 as the probability level 
whereas the highest probability of a stim- 
ulus being a critical signal was .06 in the 
present experiments and .04 in the studies 
by Jerison (1965). In view of the large dif- 
ference in signal probabilities, it appears pos- 
sible that dual processes may be present such 
that the normal signal rate may be the deter- 
miner of performance at low probabilities, 
while at high probabilities either (a) the 
probability itself determines performance, or 
(5) detection performance has reached an 
asymptote and is not affected by further 
changes in rate or probability. These factors 
require further research before any general 
conclusions may be drawn. 

Finally, the effect of males versus females, 
although of lesser importance than the other 
results, suggests that females are better able 
to maintain performance over a long period 
of time. This interaction is in general agree- 
ment with the results of a study by Whitten- 
burg, Ross, and Andrews ( 1956), although the 
exact shape of the interactions is different due 
to different time measurements. That is, the 
present study suggested that males were 
superior in the first 18-min. period with a 
reversal occurring in the second and third 
period, whereas Whittenburg et al. using 
half-hour time periods found no differences 
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during the first hour with the females being 
superior to male Ss during the last hour, 
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PARTICULARISM AND ORGANIZATIONAL POSITION: 


A CROSS-CULTURAL ANALYSIS + 


LOUIS A. ZURCHER? 
The Menninger Foundation, Topeka, Kansas 


The Stouffer-Toby Role Conflict Scale, a measure of particularism (the value for 
institutionalized obligations of friendship), was administered to 230 employees 
in 13 bank branches in Mexico and the United States. Findings suppert the 
hypothesis that for both bank officers and line employees Mexicans are sig- 
nificantly more particularistic than Mexican-Americans, who in turn are sig- 
nificantly more particularistic than Anglo-Americans. The hypothesis that, by 
ethnic group, officers are less particularistic than line employees was partially 
supported. The cultural development of particularism and the potential impact 
of that value upon organizational behavior are discussed. Some suggestions are 
offered to management and to researchers concerning the importance of under- 
standing the interactions among specific value orientations and specific 
organizational behaviors, and concerning the implications of such interactions 
for employee satisfaction and organizational integrity. 


Rapid industrialization of “underdeveloped” 
countries, continued expansion of international 
business operations, and increased use of 
indigenous labor by foreign companies 
heighten the relevance of cross-cultural stud- 
ies of individual behavior in complex organ- 
izations. Though such studies are relatively 
few, their findings invariably indicate the 
pervasive influence of cultural values upon, 
for example, the pace and pattern of company 
development, the emergence of management 
and supervisory styles, and the varieties and 
intensities of motivation for, attitudes toward, 
and alienation from work (Fayerweather, 
1959; Haire, Ghiselli, & Porter, 1963; Me- 
Clelland, 1961; Whitehill, 1964; Whyte, 1963; 
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Whyte & Williams, 1963; Zurcher, Meadow, 
& Zurcher, 1965). 

Ideally, the investigator seeking to ap- 
preciate the dynamics of cross-cultural dif- 
ferences in organizational behavior would him- 
self experience the life styles and work settings 
of his subjects (Ss), and then come to grips 
with the complex interaction between whole 
personalities and whole organizations. Con- 
siderably less ambitious, the purpose of this 
paper is to isolate a single cultural value, 
and to observe how it varies between and 
within specific work organizations. 

The term “value” is taken to mean “a 
selective orientation toward experience, im- 
plying deep commitment or repudiation, which 
influences the ordering of choices between 
possible alternatives in action [Kluckhohn, 
1961, p. 18].” Values are learned in one’s 
parent culture, and “culture” is “the dis- 
tinctive way of life of a group of people, 
their complete design for living [Kluckhohn, 
1951, p. 86].” The value to be studied in 
this paper is “particularism,” or the value 
for institutionalized obligations of friendship. 
Particularism is one component of Parsons 
and Shils’ (1959) general theory of action 
and is juxtaposed with “universalism,” or 
the value for institutionalized obligations to 
society (pp. 76-77). According to Parsons 
and Shils, before acting in a specific social 
situation the individual will be influenced by 
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a particularistic or a universalistic orienta- 
tion, and that orientation will have been 
culturally determined (p. 79). Using varying 
vocabularies, several other social scientists 
have, based upon their comparative observa- 
tions of societies in differing stages of social 
complexity, conceptualized dichotomies or 
continua bounded on one side by a predomi- 
nant value for concrete interpersonal ties and 
on the other by a predominant value for 
abstract societal expectations (Becker & 
Meyers, 1942; Lewis, 1951; Loomis & Beegle, 
1950; Miner, 1952; Redfield, 1941). Mem- 
bers of the less complex, more “primitive” 
societies are reported to tend toward par- 
ticularism, members of complex societies 
toward universalism. 

In that context, Mexico has been described 
as manifesting (modally) a cultural value for 
particularism. Parsons and Shils (1959), 
Lewis (1951), Redfield (1947), and Cohen 
(1961) report the influence of the Mexican 
social structure, particularly the extended 
kinship system, upon the individual’s assess- 
ment of events in his daily life. Government, 
laws, and other abstract societal expectations 
are observed to be distant, ephemeral, and 
untrustworthy to the Mexican, and not suffi- 
cient basis for the evaluation of other human 
beings. What seems more important is the 
depth and significance of interpersonal ties 
with blood relatives, compadres, and friends. 
Another person is accepted by the individual 
not in terms of office held, title displayed, or 
any other such abstraction, but in terms of 
immediate interpersonal experience with the 
evaluating individual. The essential questions 
are: What kind of person is he to me? Can 
he be trusted? 

On the other hand, the United States has 
been described (modally) as a universalis- 
tically oriented society. The high level of 
technology, “scientism,” and bureaucracy, the 
keen emphasis upon competition, achievement, 
and mobility encourage more shallow inter- 
actions with other human beings and a greater 
dependence upon collective societal definitions 
of others (Buber, 1958; Fromm, 1940; 
Riesman, 1950). The essential questions are: 
What (group, company, department, etc.) 
does he belong to? How high up is he? 
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If these observations on the two societies 
are accurate, differences in selected cultural 
values, specifically particularism, should be 
empirically demonstrable. That the present 
discussion of Mexico and the United States 
has been in terms of modal value orientations 
admits to the fact that, for example, groups 
in rural America may be more particularistic 
than groups in highly urbanized Mexico City. 
Thus, as with all cross-cultural research, 
special effort must be made to establish 
comparability of samples. In this study, com- 
parability was approached by drawing Ss 
from similar work organizations in Mexico 
and the United States. The structures of the 
work organizations were such that employees 
could be classified as “officer” or “line.” It 
was therefore hypothesized that otherwise 
comparable Mexican officers would be more 
particularistic than Mexican-American officers, 
who would in turn be more particularistic 
than Anglo-American officers. A similar di- 
rectional difference in particularism was hy- 
pothesized for line employees. 

Since the work organization was bureau- 
cratic in nature, the assumption would be 
that movement up the organizational ladder 
would be associated, by a selection or a 
socialization process, with a shift away from 
the particularistic value orientation. Some in- 
dications of that relationship can be found in 
the work of Merton (1940) and Whyte 
(1948). In this study it was therefore hy- 
pothesized that Mexican officers would be 
less particularistic than Mexican line em- 
ployees, Mexican-American officers less par- 
ticularistic than Mexican-American line em- 
ployees, and Anglo-American officers less par- 
ticularistic than Anglo-American line em- 
ployees. 


METHOD 
Subjects 


The work organization chosen as the setting for 
this study was the bank, because it provided an 
accessible and cross-culturally comparable bureau- 
cratic organization from which to draw Ss. Mexican 
and American banks are almost identical in formal 
organization and operating procedures (Beteta, 
1963), the former being based upon the structure 
of the latter. 

Three experimental groups were used, the num- 
ber of Ss for all three groups totaling 230. The 
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native-born, Spanish 
speaking (unilingual) Mexican citizens employed 
by two bank branches in Nogales, Sonera. The 
second group was composed of bilingual 
English) Mexican-Americans employed by two bank 
branches in Nogales, Arizona (bilingwaliem and 
United States citizenship are conditions for em- 
ployment in these branches). The third group was 
composed of English-speaking (unilingual) 
Americans employed by nine bank in 
Tucson, Arizona. All 11 United States banks are 
branches of the same parent organization. Both of 


at least one vice-president for each cultural group) 
who were at work on that day were administered 
the questionnaire. The final sample included all of 
these employees, with the exception of 1 Anglo- 
American who was removed from the Merican- 
American group, and 14 Mexican-Americans who 
were removed from the Anglo-American group. The 
employees of the Nogales, Sonora, banks were all 
Mexicans. All three groups approximated each other 
for proportion of officers (about one to every six 
line employees), for employee longevity (Afda = 2-4 
yr.), and for age (Mdn = 20-29 yr.). The ratio of 
males to females was found to be disproportionate 
among the groups (22:16 Mexican; 14:29 Mexican- 
American; 66:83 Anglo-American), but subsequent 
analysis demonstrated sex not significantly to be 
related to particularism. Officers (N=35) were 
vice-presidents, managers, assistant managers, and 
all other employees who had “officer” in their 
formal titles (i e., loan officer, personnel officer, etc.) 
Line employees (N=195) were tellers, clerks, 
typists, secretaries, including first-line supervisors 
(ie, head teller, chief clerk, etc.) 


Measures 


As part of a questionnaire packet that contained 
biographical and employment history questions and 
some attitude toward work scales, each S was 
given the Stouffer-Toby Role Conflict Scale. The 
Role Conflict Scale is a four-item Guttman instru- 
ment designed to sound the degree of particularism 
by presenting four situations involving conflicts be- 
tween obligation to a friend and more general social 
obligations, and forcing the individual to choose 
one set of obligations or the other (Stouffer & 
Toby, 1951). For example, Item 1 states: 


You are riding in a car driven by a close friend, 
and he hits a pedestrian. You know he was going 
at least 35 miles an hour in a 20-mile-an-hour 
speed zone. There are no other witnesses. His 
lawyer says that if you testify under oath that 
the speed was only 20 miles an hour, it may 
save him from serious consequences. 

What right has your friend to expect you to 
protect him ? 
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ism was significantly related to ethnic mem- 
bership (H = 35.08, dj = 2, p < .0005). Fur- 
ther analysis by Fisher’s Exact Probability 
test (Ss who scored O versus those who did 
not) and inspection of Table 1 indicate the 
direction of the overall results: Mexican 
versus Mexican-American officers (p = .045); 
Mexican versus Anglo-American officers (5 = 
.0004); Mexican-American versus Anglo- 
American officers (p = .008). Thus, as hy- 
pothesized, the particularism of the bank 
officers is influenced by cultural background— 
Mexicans significantly more particularistic 
than Mexican-Americans, who in turn are 
significantly more particularistic than Anglo- 
Americans. 

Table 2 presents the distribution of line- 
employee particularism scale scores for each 
ethnic group. As with the officers, a Kruskal- 
Wallace one-way analysis of variance in- 
dicated that line-employee particularism was 
significantly related to ethnic membership 
(H = 9.92, df = 2, p < .0005). Further anal- 
ysis by the Median Test (one-tailed) and 
inspection of Table 2 indicate the direction of 
the overall results: Mexican versus Mexican- 
American line employees (x? = 14.44, df = 1, 
5 < .0005); Mexican versus Anglo-American 
line employees (x? = 24.80, df=1, p< 
0005); Mexican-American versus Anglo-Amer- 
ican line employees (x? = 3.9, df=1, p< 
.025). Again as hypothesized, the particular- 
ism of the bank line employees is influenced 
by cultural background Mexicans signifi- 
cantly more particularistic than Mexican- 
Americans, who in turn are significantly more 
particularistic than Anglo-Americans. 

With ethnic groups combined, officers were 
demonstrated to be significantly less partic- 


TABLE 1 


OFFICER Particutarism BY CULTURAL ORIGIN 


Scale score 


Cultural origin N 

0 1 2 3 4 
Mexican 2 2 1 0 5 
Mexican-American 4/0 0 1 1 6 
Anglo-American 2113 0 0 0 | 24 
Total 25°) 5 2 2 135 
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TABLE 2 


LINE-EMPLOYEE ParTICULARISM BY CULTURAL ORIGIN 


Scale score 


Cultural origin 


Mexican 5 5 6 7 | 10 33 
Mexican-American 17112 4 4 0 | 37 
Anglo-American 82 23 | 10 8 2112 
Total 104] 40 20 | 19 | 12 | 195 
ularistic than line employees (H = 3.56, 


4/ 5, p< .05). However, the hypothesis 
that officers would be significantly less par- 
ticularistic than line employees for each ethnic 
group was supported only for the Anglo- 
Americans (H = 10.31, df= 1, p < .0005). 
Inspection of Tables 1 and 2 suggests that the 
distribution of Mexican and Mexican-Amer- 
ican scores tends toward the hypothesized 
difference between officers and line employees, 
but the small Ws, particularly for the officers, 
make interpretation tenuous and obscure 
statistical analysis. 


Discussion 


The hypothesis that a single value, par- 
ticularism, varies cross-culturally for officers 
and line employees in a specific type of work 
organization has been supported. This find- 
ing encourages the conclusion that value 
orientations are central factors to be con- 
sidered in the social psychology of work, 
but raises further questions about the relation 
of such values to measurable organizational 
behavior. Zurcher et al. (1965) report find- 
ings with the present S group that indicate 
a significant positive relation between par- 
ticularism and alienation from work in the 
bank, which is interpreted to be a universal- 
istic organization. The same authors point 
to an active social club in the Mexican- 
American S banks and suggest that those 
informal ties may be the result of particular- 
istic needs. McClelland (1961) reports Mex- 
ican businessmen to manifest a significantly 
higher need for affiliation (a particularistic 
parallel) and a significantly lower need for 
achievement than do Anglo-American busi- 
nessmen. Fayerweather (1959) observes that 
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PARTICULARISM AND ORGANIZATIONAL POSITION 


Mexican businessmen seem to be more con- 
cerned with the maintenance of personal al- 
liances than with bureaucratic efficiency and 
if forced to choose between those alternatives 
will choose the former. Whyte and Williams 
(1963, p. 6) conclude that the Peruvian 
worker “looks upon the supervisor as a total 
human being, without thinking about the 
various aspects of his job performance,” 
and that this “fits in with the emphasis upon 
the person and personalism” that has been 
noted in Latin American culture. 

Further research, certainly interdisciplinary 
in nature, is needed to delimit more specifi- 
cally the degree to which particularism or 
other values vary cross-culturally and the 
impact they have upon organizational be- 
havior—that is, performance, supervision, 
policy formulation, absenteeism, turnover, 
etc. Especially useful would be data illustrat- 
ing the generation of conflicts in personnel 
expectations from disparities in value orienta- 
tions, when and if such conflicts are functional 
or disfunctional for the individual and for the 
organization, and the means, if appropriate 
and desirable, for the amelioration of those 
conflicts. 

The present study was not able clearly to 
demonstrate for each of the three ethnic 
groups that officers were significantly less 
particularistic than line employees. Insufficient 
officer W may account for the lack of statis- 
tically significant differences between officer 
and line Mexicans and Mexican-Americans— 
a difference which was markedly demonstrated 
for the considerably larger Anglo-American 
group. On the other hand, it may be that the 
value for particularism is not importantly 
altered by or a requisite for managerial status 
in the Mexican and Mexican-American banks. 
The data are not conclusive for either inter- 
pretation. The significant difference estab- 
lished in the Anglo banks supports the original 
hypothesis, but introduces an interesting 
question. What proportion of the difference in 
particularism can be accounted for by the 
screening and selecting process of promotion, 
and what proportion can be accounted for by 
the process of officer role-enactment following 


promotion? 
The data and interpretations presented in 


143 


this paper suggest to management in com- 
plex organizations, particularly those organ- 
izations with international branches or sub- 
sidiaries, that differences in the cultural back- 
ground of personnel may yield differences in 
their value orientations—value orientations 
which may influence organizational behavior 
Also, management should not assume that a 
set of policies, organizational expectations, or 
leadership styles established and proven suc- 
cessful in one culture will inevitably be suc- 
cessful in another. 

The findings of this study suggest to the 
researcher that there is continued need for 
interdisciplinary studies of the interactions 
among specific cultural values and specific 
organizational behaviors. 


REFERENCES 


Becxer, H., & Mevers, R. C. Sacred and secular 
aspects of human association. Sociometry, 1942, 
5, 207-229. 

Betera, M. Banking laws of Mexico, Mexico This 
Month, August 1963, 7-10. 

Burer, M. I and thou. New York: Scribner's, 1958. 

Conex, V. A. Patterns of friendship. In Y. A. Cohen 
(Ed.), Social structure and personality. New 
York: Holt, Rinehart & Winston, 1961. Pp. 353- 
383. 

FAYERWEATHER, J. The executive overseas. New 
York: Syracuse University Press, 1959. 

Fromm, E. Escape from freedom. New York: Farra 
& Rinehart, 1940. 

Hare, M. Grrsettr, E. E, & Porter, L. W. 
Cultural patterns in the role of the manager. 
Industrial Relations, 1963, 2, 95-117. 

RTV CSR HORN, C. The study of culture. In D. Lerner 
& H. D. Lasswell (Eds.), The policy sciences. 
Stanford: Stanford University Press, 1951. Pp. 
86-101. 

Krucxnonyx, C. The study of values. In D. N. 
Barrett (Ed.), Values in America. Notre Dame: 
University of Notre Dame Press, 1961. Pp. 17-45. 

Lewis, O. Life in a Mexican village: Tepostlan 
restudied. Urbana: University of Illinois Press, 
1951. 

Loomis, C. P., & Brecrr, J. A. Rural social sys- 
tems. New York: Prentice-Hall, 1950. 

McCretranp, D. The achieving society. Princeton: 
Van Nostrand, 1961. 

Merton, R. Bureaucratic structure and personality. 
Social Forces, 1940, 18, 560-568. 

Miner, H. The folk-urban continuum. American 
Sociological Review, 1952, 17, 529-537. 

Parsons, T., & Suits, E. A. (Eds.) Toward a general 
theory of action. Cambridge: Harvard University 
Press, 1959. 


144 Lours A. ZURCHER 


RepFietp, R. The folk culture of Yucatan. Chicago: 
University of Chicago Press, 1941. 

REDFIELD, R. The folk society. American Journal of 
Sociology, 1947, 52, 293-308. 

RESMAN, D. The lonely crowd. New Haven: Yale 
University Press, 1950. 

Stourrer, S. A, & Topsy, J. Role conflict and 
personality. American Journal of Sociology, 1951, 
56, 395-406. 

WHITEHILL, A. M., Jr. Cultural values and em- 
ployee attitudes: United States and Japan. Journal 
of Applied Psychology, 1964, 48, 69-72. 

WRVITE, W. F. Culture, industrial relations, and 
economic development: The case of Peru. In- 


ZURCHER, L. A., Meapow, A., & Zurcuer, S. L. 


dustrial and Labor Relations Review, 1963, 16, 
583-594, 


Wave, W. F, & Wruams, L. K. Supervisory ] 


leadership: An international comparison, Sym- 
posium B3, Paper No. B3C, New York State School 
of Industrial and Labor Relations, Cornell Uni- 
versity, 1963, 1-8. 


WIVITE, W. H. The organization man. New York: 


Harper, 1948. 


Value orientation, role conflict, and alienation from 
work: A cross-cultural study. American Sociolog- 
ical Review, 1965, 30, 539-548. 


(Received April 3, 1967) 


Journal of Applied Psychology 
1968, Vol. 52, No. 2, 145-147 


ACHIEVEMENT VALUES AND SCIENTIFIC PRODUCTIVITY 


BISHWA NATH MUKHERJEE * 
Nagpur University, Nagpur, India 


50 college science teachers were tested on Mukherjee Sentence Completion 
Test (SCT), a forced-choice scale for measuring achievement values (v Ach). 
The hypothesis of a positive relationship between v Ach and scientific produc- 
tivity was tested. Results indicated a significant positive correlation between 
the 2 variables. Most of the teachers having advanced research degrees were 


also found to have high v Ach scores 


. The study provides additional evidence 


of the construct validity of the SCT as a measure of v Ach and demonstrates 
the importance of v Ach in the prediction of scientific productivity. 


The purpose of the present study is to 
investigate the relationship between achieve- 
ment values (v Ach) of university teachers 
in the science faculty and their productivity 
in written scientific work defined in terms of 
the number of publications in scientific 
journals, research reports, technical memos, 
and papers contributed to scientific meetings 
and conferences (Taylor, Smith, & Ghiselin, 
1963). The research is based on the premise 
that achievement values can best be viewed 
as those principles that guide one’s conduct 
in making substantive contribution to the 
field or profession in which one is engaged. 
It was therefore predicted that achievement 
values of science teachers and the number of 
their scientific publications are positively cor- 
related. In addition to verifying the above 
hypothesis, the present study aimed at ex- 
ploring the relationship, if any, between 
achievement values and academic degrees 
earned. 

METHOD 
Subjects 

Fifty male teachers of various science depart- 
ments of Patna University, Patna, India volunteered 
as Ss for the study. Their ages ranged from 22 to 
55 yr. with a mean of 31.86 yr. Out of the 50 
teachers who were sampled for the study, 28 
teachers were found to participate in both under- 
graduate and graduate programs. Of the remain- 
ing, 17 were primarily engaged in teaching at the 
undergraduate level and 5 were exclusively connected 
with graduate programs. 


1The author wishes to convey his heartfelt 
thanks to Braj Kumar Mishra, lecturer in psy- 
chology, Sindri College, Sindri, India for collecting 
the data reported in this article. 


Measurement of Achievement Values 


Achievement values were measured by the Sentence 
Completion Test (SCT), a forced-choice scale, de- 
veloped by Mukherjee (1964). The SCT consists 
of SO forced-choice triads (one item reflecting 
achievement-related sentiment and the other two 
pertaining to different aspects of manifest needs) 
selected in such a manner as to minimize the “social 
desirability” factor. A few examples of the types 
of forced-choice triads constituting the SCT are 
given below: 


F 
A. to be faithful to my friends 
and colleagues. 
B. to be very systematic in my 
work. 
C. to do my best in whatever 
work I undertake. 
2. I avoid 
A. gay and irresponsible pleas- 
ure · seekers. 
B. mental confusion and chaos. 
C. those situations which are 
not competitive. 
3. I have a general tendency ....... 
A. to continue a work till it 
is finished. 
B. to analyze others’ judgments 
critically. 
C. to be polished in my manners. 


The S’s score on SCT is the number of times he 
has chosen a statement reflecting achievement- 
related values. Details concerning the construction, 
administration, scoring, and various psychometric 
properties of the SCT have been reported by 
Mukherjee (1964, 1965c). The test-retest reliability 
of SCT after an interval of nearly 2 mo. has 
been found in various studies to range from .71 
to .83, The test-retest correlation has been found 
to be almost .75 for a sample of 100 Indian male 
graduates to whom the test was readministered after 
a month and a half (Mukherjee & Sinha, 1967). The 
Kuder-Richardson reliability estimate for the SCT 
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was .716 for a sample of 248 American colleze 
students enrolled at the Jeffersonville and Blooming- 
ton Campus of Indiana University. Previous studies 
(Mukherjee, 1964, 1965a, 1965c, 1965d, 1966) have 
demonstrated the concurrent and construct validities 
of SCT as a measure of v Ach. A factor analysis 
of SCT (Mukherjee, 1965b) revealed that the test 
covers some of the important dimensions of cog- 
nitive expectations regarding achievement, such as, 
aspiration, ego-ideal, perseverance, identification with 
a successful authority, preference for difficult tasks, 
etc, 


Procedure 


Mimeographed copies of SCT along with separate 
answer sheets were distributed randomly to 50 
science teachers regardless of their age and fields 
of specialization. They were requested to complete 
the SCT during their leisure time and return the 
answer sheet within a couple of days. All answer 
sheets were turned in within 2 days. On the answer 
sheet, each S provided information regarding his 
(a) present academic rank, (b) length of service 
as a college teacher, (c) academic degrees, (d) level 
of teaching, (e) age, and (f) publications. Papers 
published in scientific journals only defined each 
teacher's level of scientific productivity. Papers con- 
tributed to various scientific meetings were also 
considered, irrespective of whether they were 
published or not. 


RESULTS 


Based on the total scores on SCT, Ss were 
classified into two criterion groups, High 
v Ach and Low v Ach. The 25 Ss scoring 22 
and above were arbitrarily designated as the 
High group and the remaining 25 Ss scoring 
21 and below formed the Low v Ach group. 
Table 1 presents the means for the two 
groups on some of the selected variables. 

The data on number of scientific publica- 
tions were examined by f test (one-tailed) 
for differences between means and by cor- 
relational analysis. However, the Student ¢ 


TABLE 1 


MEANS FOR THE Two CRITERION Groups 
on A Few SELECTED VARIABLES 


High v Ach | Low v Ach 
Variable 


group group 

(N=25) | (N=25) 
SCT score 23.96 16.80 
No. publications 8.20 3.80 
Teaching experience in years 8.70 9.49 
Age in years 31.00 32.72 
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test could not be applied directly since the 
High v Ach group had a significantly higher 
variance in publication variable. So in order 
to achieve homoscedasticity, the number of 
papers (P.) contributed by each S was sub- 
jected to a transformation of the form: log 
(Y; +1). The means of these transformed 
scores for the High and Low v Ach groups 
were .7001 and .3665, respectively. The 
unbiased variances turned out to be .2541 
and .2478, respectively. A ¢ test indicated 
that the difference between the two criterion 
groups with respect to transformed scores 
on the publication variable was statistically 
significant (f = 2.36, p < .02). Computation 
of the product-moment correlation between 
SCT scores and number of publications also 
indicated a significant positive relationship 
(r= 32, p < 02). 

In order to test the hypothesis of no as- 
sociation between v Ach as measured by SCT 
and degrees earned, a chi-square test was 
tun. The x? turned out to be 4.024 which 
for 1 df is significant beyond .05 level. When 
the data on level of teaching were subjected 
to a chi-square analysis, it was found that 
most of the teachers belonging to the High 
v Ach group participated in the graduate 
program whereas most Ss having low v Ach 
scores were exclusively engaged in under- 
graduate teaching (x1? = 7.20, p < .01). The 
hypothesis of no association between v Ach 
and academic rank could not be rejected 
(x1? = 2.10, p> .10). Similarly, age and 
v Ach scores were found to have no relation- 
ship (x1? = 32). A chi-square test also re- 
vealed that the two criterion groups did not 
differ significantly with respect to length of 
teaching experience (xı? = .72). 


Discussion 


The study demonstrates that college teach- 
ers in science having high achievement values 
show greater scientific productivity than their 
colleagues whose v Ach scores are low. Con- 
versely, the more productive Ss of the present 
study have attitudes which suggest that they 
strive for a long record of professional 
achievement, for more distant goals, and a 
stronger desire to compete successfully with 
a standard of excellence than the less pro- 
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ductive Ss even though the opportunities 
for scientific research are comparable for 
both the groups. Since the above relationship 
was predicted in advance, the study provides 
additional evidence of the construct validity 
of the SCT as a measure of achievement 
values. 

Direct validational evidence is also fur- 
nished by some of the other data reported 
here. That the relationship between SCT 
scores and academic degrees earned should 
be positive is quite consistent with the con- 
cept of achievement values according to 
which a person having high achievement 
orientation is one who shows self-actualizing 
tendency (Maslow, 1954), exercises his 
talents fully and enthusiastically to achieve 
higher goals that are related to his further 
self-development. It is no wonder that teach- 
ers having higher scores on the SCT were 
found more frequently to teach at the grad- 
uate level than those who have low scores on 
the SCT. The finding holds true even when 
factors such as age, teaching experience, and 
academic rank are held constant. The result 
is interpreted to mean that Ss of the High 
v Ach group have a higher level of aspira- 
tion and a keen interest in undertaking 
difficult tasks. The study lends conclusive 
evidence for the hypothesis that the produc- 
tive scientist has a higher achievement value, 
and thus individual differences with regard 
to scientific productivity may be accounted 
for not only in terms of ability and op- 
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portunity but also in terms of their achieve- 
ment values, 
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DIFFERENTIAL EFFECT OF NOISE ON TASKS OF 
VARYING COMPLEXITY * 


DAVID H. BOGGS* axo J. RICHARD SIMON 
University of lowa 


48 Ss performed on 1 of 2 complexity levels of a 4-choice RT task 
at the same time, performed a secondary auditory monitoring task. All 


Ss performed in both quiet and in noise, Performance indexes were RT and 
secondary-task errors. Noise produced a significantly greater increase in sec- 


errors when the secondary task was paired with the complex 


task 
primary task than when it was paired with the simple primary task. Sec- 
ondary-taik performance provided a more sensitive measure of both task 
complexity and the effect of noise than did the RT measure. 


The purpose of this experiment was to test 
the hypothesis that the deleterious effect of 
noise on performance increases as a function 
of increasing task complexity. There have 
been numerous studies conducted to investi- 
gate the effect of environmental noise on per- 
formance. Some have indicated a decremental 
effect of noise (e.g, Broadbent, 1953, 1954, 
1958a; Jerison, 1959; Jerison & Wing, 1957; 
Sanders, 1961; Woodhead, 1964), while oth- 
ers have indicated an incremental effect (e. g., 
Kirk & Hecht, 1963; Wilbanks, Webb, & 
Tolhurst, 1956). A substantial number of 
studies, however, have revealed no significant 
effect whatsoever (e.g., Broadbent, 1957; Hy- 
man, 1950; Jerison, 1957; Kryter, 1950; 
Kurz, 1964; Wilbanks, Webb, & Tolhurst, 
1956). If research which has remained un- 
published as a consequence of achieving nega- 
tive results were to be made available (e. g., 
Chisman, 1963), the weight of evidence indi- 
cating that noise does not significantly affect 
performance would be considerably increased. 

One explanation for the prevalence of nega- 
tive results in laboratory studies involving 
noise might be that Ss were able to draw upon 
unused perceptual capacity and thereby main- 
tain their performance despite the distracting 
noise. In order to detect changes in unused 
capacity associated with performance in 

1 This paper is based upon a thesis completed by 
the first author under the direction of the second 
author and submitted to the University of Iowa in 
partial fulfillment of the requirements of the mas- 


ter’s degree. 
2 Now at the University of Connecticut. 


noise, the present study utilized a procedure 
called the “method of simultaneous tasks.” 
The procedure involved having S perform, 
simultaneously with the primary experimental 
task, a secondary task which effectively oc- 
cupied that portion of S’s perceptual capacity 
which was not occupied by the requirements 
of the primary task. Thus, performance on 
the secondary task provided a measure of un- 
used capacity, and any effect of noise suffi- 
cient to require the use of this excess capac- 
ity to maintain primary-task performance 
was evidenced as a decrement in secondary- 
task performance. According to Poulton 
(1965), this method was first employed by 
Bornemann (1942a, 1942b) although it was 
apparently also developed independently by 
Bahrick, Noble, and Fitts (1954). 

Of special relevance to the present research 
are two studies which utilized the method of 
simultaneous tasks. Poulton (1958) used 
secondary-task performance as a measure of 
the amount of attention required by a per- 
ceptual-motor task and was able to demon- 
strate a difference in difficulty between two 
versions of the task which was not apparent 
by comparing performance on the two tasks 
when performed alone. Broadbent (1964) 
employed the method to investigate the rela- 
tionship between attention required to a re- 
action-time (RT) task and degree of stimu- 
lus-response (S-R) compatibility. Using a 
secondary, mental-arithmetic task, he com- 
pared the performance of Ss working on com- 
patible and incompatible versions of an RT 
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task 

perceptual-motor task 
difficult by making the S-R code less compati- 
ble, and that this difference in difficulty may 
be measured indirectly by observing 
ance on a simultaneous, secondary task. 

There are probably many reasons for the 
apparent conflicting results concerning the 
effects of noise on performance. One impor- 
tant factor seems to be that some tasks are 
more vulnerable to noise than others. Accord- 
ing to Broadbent (1958b), noise produces 
brief failures of perception which, in turn, 
result in the marked decrements observed on 
vigilance tasks and tasks which require con- 
tinuous handling of information. Jerison 
(1957) suggests that the primary factor de- 
termining the effect of noise on performance 
may be the “flexibility of attention” required 
by the task. Noise seems to have a decre- 
mental effect on vigilance tasks which involve 
several sources of information whereas tasks 
which require attention to only one informa- 
tion source are not 


effect on performance. Broadbent (1954) 
found a noise - induced decrement on a “diffi- 
cult” watch-keeping task (Twenty Dials Test) 
but not on an “easy” task (Twenty Lights 
Test). Park and Payne (1963), however, 
found no evidence of an in between 
noise conditions and the level of difficulty of 
division problems. 

The present experiment was concerned with 
the variable of difficulty or, more properly, 
task complexity. In this study, Ss performed 
two tasks simultaneously. The overall 
plexity or perceptual load imposed by these 


tasks 
ing the S-R compatibility of the primary task, 
a four-choice RT task. Performance on a sec- 
ondary, auditory-monitoring 
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apparatus, on an imaginary line bisecting, and per- 


pendicular to, a line passing through the four switch 


knobs 


Secondary Task 


The secondary task was similar to one used by 
Brown (1965). A continuous Sequence of digits was 
Presented to S through a headphone placed over his 
left ear. He was instructed to listen for odd-even- 
odd sequences of digits, and to respond immediately 
by saying “yes” whenever such a sequence was de- 
tected. The digits were recorded on one channel of 
a stereophonic magnetic tape at regular intervals of 
1.5 sec. One odd-even-odd sequence was located 
randomly within every 15-sec. period of the 10-min. 
tape, so that the tape contained a total of 40 odd- 
even-odd sequences. The second channel contained 
signals, audible only to E, indicating the location of 
each odd-even-odd sequence, The tape was played 
on a Sony Model 464 stereophonic recorder. The 5 
listened to the digits channel through a Grason- 
Stadler Model TDH-39 earphone (equipped with 
MX 41-AR rubber cushion) connected directly to 
the output of the appropriate internal playback 
amplifier of the recorder, while E listened to the 
marker signals through headphones connected 
through a Sony external power amplifier to the 
playback-preamplifier output of the second channel. 
The playback-preamplifier gain of the channel con- 
taining the signals was reduced to a minimum to 
prevent leakage into the digits channel, This ar- 
rangement permitted constant synchronization of 
the digits with the signals to E, and provided com- 
plete consistency throughout the experiment in the 
scoring of errors on the secondary task. 


Noise 


The noise consisted of recorded 0.5-sec. bursts of 
the sound produced by a bandsaw cutting aluminum. 
Acoustical frequency analysis revealed strong peaks 
at approximately 1800 and 1900 cps. A total of 90 
bursts (92 db. SPL) were Presented in the course of 
each 10-min. experimental run. These bursts were 
recorded on the tape containing the digits. The 
bursts were located randomly, but with the restric- 
tion that they never overlap any of the secondary- 
task digits. The masking effect of noise, therefore, 
was not a factor in this experiment. The digits on 
the “noise” tape were copied directly from the 
“quiet” tape, using a second recorder, so that the 
only difference between the two tapes was, in fact, 
the presence of noise on the “noise” tape. 


Subjects 


The Ss were 24 males and 24 females from the 
elementary psychology courses at the University of 
Iowa. All Ss reported having normal hearing, 


Experimental Design 


A mixed design involving one within- and three 
between-S dimensions was employed (Lindquist, 
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1953). Twelve men and 12 women performed on the 
simple RT task, and 12 men and 12 women per- 
formed on the complex RT task. Within each com- 
plexity condition, 12 Ss (6 men and 6 women) per- 
formed first in quiet and then in noise, while the 
remaining 12 Ss (6 men and 6 women) performed 
first in noise and then in quiet. Complexity level and 
sequence were assigned randomly to Ss within each 
sex group. 


Procedure 


Each S served in one experimental session con- 
sisting of one run in quiet and one in noise. Each 
Tun was 10 min. in duration. The S listened first to 
a 1-min. tape demonstrating the correct method of 
responding to the digits task. He was then given 
two consecutive 1-min. practice sessions on the digits 
task alone. If he failed to respond without error 
during the first min. of practice, the tape was stopped 
and the instructions repeated, All Ss succeeded in 
performing without error during the second min. of 
practice. Next, the RT-task instructions were read, 
followed by the appropriate instructions for the first 
experimental condition, that is, quiet or noise. In 
order to establish the relative Primacy of the RT 
task, S was instructed repeatedly that he should do 
as well as he could on the digits task, but that he 
should not let the digits task interfere with the RT 
task. After completing the first run, the instructions 
for the second condition were read, and S was again 
cautioned not to let the digits task interfere with 
the RT task. The entire session lasted approximately 
45 min. 


Scoring 


Records were kept of RT, errors on the RT task, 
and errors on the digits task. Omissive errors (fail- 
ure to report the occurrence of an odd-even-odd 
Sequence in the 1.5-sec. interval after the comple- 
tion of the sequence) and commissive errors (in- 
appropriate “yes” responses) on the digits task were 
recorded separately. The first 12 RT trials of both 
experimental runs were considered practice trials and 
were not included in the calculation of RT medians 
and error totals. 


RESULTS 


Performance data from both primary and 
secondary tasks were analyzed to determine 
the effects of noise on performance, 


Primary Task 


An analysis of variance ® indicated no sig- 
nificant effect of noise on RT (787 vs. 784 


8 Summaries of analyses of variance and other sum- 
mary tables have been deposited with the American 
Documentation Institute. Order Document No. 9747 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washing- 
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TABLE 1 


TOTAL ERRORS ON SECONDARY TASK AS A FUNCTION OF Sequence, Noise, Task Compuxxrry, axp Sex 


Males Females 
Task Sequence Condition Condition 2 
Quiet Noise M Quiet Noise M 
4 Quiet-Noise 7.67 11.83 9.75 10.33 13.67 12,00 10.88 
Simple Noise-Quiet 8.67 11.17 9.92 7.33 10.33 8.83 9.38 
M 8.17 11.50 9.83 8.83 12.00 10.42 10.13 
Quiet-Noise 14.17 21.00 17.58 11.67 18.33 15,00 16.29 
Complex Noise-Quiet 8.83 17.83 13.33 10.33 17.00 13.67 13.50 
M 11.50 19.42 15.46 11.00 17.67 14.33 14.90 
Combined mean for both 9.83 15.46 12.65 9.92 14.83 12,38 
complexity levels 


msec.). There was a tendency for males to 
respond faster than females (766 vs. 806 
msec.), and for RT on the complex task to be 
slower than on the simple task (808 vs. 763 
msec.), but neither of these effects was sig- 
nificant. The only significant factor affecting 
RT was order; F(1/40) = 11.76, p < .01, 
with the first run being slower than the sec- 
ond (805 vs. 766 msec.). No errors were made 
on the primary task during the experimental 
runs. 


Secondary Task 


Table 1 shows the effects of sequence, 
noise, task complexity, and sex on the total 
number of secondary-task errors, that is, 
omissive plus commissive errors. It may be 
seen that mean error totals for all sex-se- 
quence groups were higher in noise than in 
quiet, and that, in general, a greater number 
of errors was made under the complex task 
condition than under the simple condition. An 


ton, D. C. 20540. Remit in advance $1.25 for micro- 
film or $1.25 for photocopies and make checks pay- 
able to: Chief, Photoduplication Service, Library of 
Congress. 

In a separate experiment, 20 Ss performed the 
RT task alone, that is, without the simultaneous 
secondary task. Half of the Ss performed the simple 
version of the task while the other half performed 
the complex version. Performance on the complex 
task was significantly slower than on the simple 
task (708 vs. 587 msec.; t=2.45, p<.05). 


analysis of variance revealed that the main 
effect of noise was significant; F(1/40) = 
62.52, p < .001. The analysis also indicated a 
significant main effect of complexity; 
F (1/40) = 8.31, p < 01. Of major interest, 
however, was a significant Noise X Complex- 
ity interaction; F(1/40)=9.19, p< 01. 
Table 2 summarizes this effect. It will be 
noted that the introduction of noise produced 
a greater increase in errors under the complex 
task condition than under the simple task 
condition. There was no sequence effect to 
complicate the interpretation. 

Separate analyses of variance were per- 
formed for omissive and for commissive er- 
rors on the secondary task. Results of the 
analysis of omissive errors were identical to 
those of the total errors; that is, the effects 
of both noise and task complexity were sig- 


TABLE 2 


EFFECT or NOISE ON SECONDARY-TASK PERFORMANCE 
UNDER SIMPLE AND COMPLEX TASK CONDITIONS 


Mean errors Difference 
(noise 
Task minus 
Quiet Noise N quiet) 
Simple 8.50 11.75 10.13 3.25 
Complex 11.25 18.54 14.90 7.20 
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nificant, and there was a significant Noise X 
Complexity interaction. In the analysis of 
commissive errors, noise was the only signifi- 
cant factor. 


Discussion 


Results of this study demonstrate that noise 
produces a significant decrement in perform- 
ance and that the magnitude of this decre- 
ment varies as a function of task complexity. 
Results also suggest that noise effects may 
have gone undetected in much previous re- 
search for lack of a sensitive criterion mea- 
sure. 

Many tasks demand less than the total per- 
ceptual capacity which an individual has 
available. In these situations, the person pos- 
sesses what may be termed a reserve capacity, 
which is free from the demands of the task. 
Should the perceptual demands of the task or 
of the situation increase, he may draw upon 
this reserve capacity and, by so doing, may be 
able to maintain task performance at a con- 
stant high level. In the present study, the 
fact that increased task complexity, and the 
introduction of noise, did not affect RT per- 
formance is attributed to this compensatory 
mechanism. 

The method of simultaneous tasks provides 
a means of detecting changes in reserve ca- 
pacity associated with changes in the task 
and/or environment. Secondary-task perform- 
ance was used in this study as a measure of 
Teserve perceptual capacity. In this experi- 
ment, the complex primary task made greater 
perceptual demands than the simple primary 
task. Therefore, less reserve capacity was 
available for the secondary task when it was 
paired with the complex task than when it was 
paired with the simple task. This resulted in 
the greater number of errors on the secondary 
task when it was paired with the complex 
primary task. The introduction of noise “used 
up” some of S’s reserve capacity; that 18 8 
had to draw from his reserve so that primary- 
task performance would not suffer as a conse- 
quence of the noise. There was, therefore, 
less perceptual capacity available for sec- 
ondary-task performance, and, consequently, 
errors increased. The fact that the noise pro- 
duced a greater increase in secondary-task 
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errors when the secondary task was paired 
with the complex primary task than when it 
was paired with the simple primary task may 
be attributed to the fact that the noise occu- 
pied proportionally more of the reserve ca- 
pacity in the former case than in the latter. 
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EXPECTATIONS AND DROPOUTS N SCHOOLS OF NURSING 


MILDRED ENGBERG KATZELL? 
National League for Nursing, New York City 


The expectations and experiences of stress and satisfaction of 1,852 Ist-yr. stu- 
dents in 43 schools of nursing were assessed by questionnaire. Low but signifi- 
cant negative correlations were obtained between withdrawal and (a) experi- 
enced satisfactions and (b) confirmation of expectations, especially with respect 
to satisfactions. Withdrawal was unrelated to experienced stress, whether ex- 


pected or unexpected. 


Remarks about the nursing shortage are 
by now commonplace. Since 20-25% of 
entrants to schools of nursing withdraw dur- 
ing the first year, reducing this attrition rate 
would alleviate the shortage. 

Fox, Diamond, and Walsh (1960) re- 
ported on the situations which student nurses 
had indicated as stressful and satisfying, 
including those elements in the school situa- 
tion to which students alleged their with- 
drawal was due. In this report and elsewhere, 
students who disliked nursing indicated that 
it was not what they had expected. 

In addition, Kibrick ( 1958) observed that 
the satisfactions expected by nursing students 
did not have much bearing on their remain- 
ing in school, but that those who were more 
realistic in their appraisal of nursing were 
more likely to remain. This finding is con- 
sistent with that of Weitz (1956) who dis- 
covered that life insurance agents were more 
likely to remain in their jobs if they had a 
clear picture of the job duties. Block, Yuker, 
Campbell, and Melvin (1964) and Brown 
and Landsberger (1960) found that dis- 
satisfaction is influential in determining 
whether an individual will withdraw during 
the early stages of employment. Block and 
his colleagues added that the longer the 
worker remains with the company, the less 
important satisfaction becomes in determin- 
ing his further continuation. This may also 
be true of student nurses, among whom the 
withdrawal rate is greatest during the first 
year, and declines markedly thereafter. 


1 This article is based upon data collected by the 
author in fulfilling the dissertation requirements for 
the PhD degree at Teachers College, Columbia 
University. 


Among the theories that have dealt with 
withdrawal from group participation are 
those of Ross and Zander (1957), March and 
Simon (1958), Herzberg, Mausner, and 
Snyderman (1959), Katzell (1964), and 
Vroom (1964). 

Degree of need satisfaction was hypoth- 
esized by Ross and Zander to be related to 
continued participation in the employment 
situation. Herzberg et al., on the other hand, 
classified the elements of the work situation 
into satisfiers or motivating factors, and dis- 
satisfiers or hygiene factors. Turnover, in 
this theory, was viewed as being related to a 
low level of hygiene factors, causing dis- 
satisfaction, whereas the motivating factors, 
or satisfiers, were perceived as being related 
primarily to productivity. 

In Katzell’s view, withdrawal is directly 
related to the experienced or expected dis- 
satisfactions associated with the situation 
and the satisfactions associated with an 
alternative situation. He also theorized that 
withdrawal is negatively related to the experi- 
enced or expected satisfactions associated 
with the situation and to the dissatisfactions 
associated with an alternative situation. 
Vroom postulated that the more satisfied the 
individual, the greater the force on him to 
remain in the situation and the less the 
probability of his voluntarily withdrawing 
from it. He also indicated that decisions to 
remain or withdraw are related to the positive 
and negative valences of outcomes and to 
the expectancies that the outcomes will be 
attained. 

March and Simon specified that continued 
participation depends upon the difference be- 
tween rewards and contributions, and sug- 
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gested that the diference be estimated from 
measures of satisfaction, since the greater 
the excess of rewards over contributions, the 
greater should be satisfaction. Satisfaction 
was seen as being related to, among other 
factors, predictability in terms of the degree 
to which expectations are realized. 


Osyectives 


This study was undertaken as a partial 
test of the March and Simon theory, as it 
applies to withdrawal from schools of nurs- 
ing. From the theory, it should follow that 
the greater the number of satisfactions ex- 
perienced, the more likely a student will be 
to continue in a school of nursing. Conversely, 
the greater the number of stresses experi- 
enced, the more likely a student will be to 
withdraw. In addition, a student will be 
more likely to withdraw if she experiences 
a large number of unexpected stresses than 
if she experiences the same number of stresses 
but they were expected. It should also be 
true that expected but unrealized satisfac- 
tions will tend to cause withdrawal, and that 
experiencing expected satisfactions will do 
more to prevent withdrawal than experiencing 
unexpected satisfactions. 


METHOD 
Sample 


The sample comprised 1,852 first-year students in 
43 schools of nursing. Among the 1,852 students, 
1,439 (77.7%) returned for the second year; 183 
(9.9%) withdrew for academic reasons; 250 (124%) 
withdrew for nonacademic reasons. 


Instruments 


A 139-item questionnaire was constructed tof 
measure the satisfactions (56 items) and stresses 
(83 items) expected and by the students. 
A single set of items was used, with one form of 
instructions being given when the students’ ex- 
pectations were collected, and another when ex- 
periences were collected. 

Illustrative of the questionnaire items are the 
following 10, of which the first 5 were classified as 
stress items, and the remainder as satisfaction items. 

1. Being homesick. 7 
. Being respected less as a student nu 
people outside the school, than if one had 
pursued another career. R 
3. Spending very little time working with pa- 

tients. 

4. Doubting the wisdom of the choice of nursing 
as a career. 
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Fourteen scores were derived 
the pair of questionnaires * 


: 
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Procedure 


During the frst week they were in school, the 
students completed the Expectations Questionnaire. 


after the students had entered the schools of nursing, 
a questionnaire was sent to each cooperating school 
to karn which students had withdrawn and the 
reason for withdrawal of each. 

The 14 scores derived from the questionnaires 
were correlated, using point-biserial coefficients, 
with (e) survival-withdrawal, (6) survival-academic 
withdrawal, and (c) survival-nonacademic with- 
drawal. 


Resvutts 


The scores of the 1,439 survivors and the 
three categories of dropouts are compared 
in Table 1. Differences significant at the 
l level were found between the mean scores 
of the survivors and each of the dropout 
groups in the total number of satisfactions 
experienced, the number of expected satis- 
factions experienced, and the number of ex- 
pected satisfactions which were not experi- 
enced. In each case, those who remained in 
school had experienced more satisfactions. 
Although survivors and dropouts did not differ 
in numbers of expected satisfactions and 
stresses, the survivors had significantly more 
of their expectations confirmed than did the 
dropouts. 

Survivors experienced more 
satisfactions than did dropouts, the difference 
being significant in the academic and total 
dropout groups. The academic dropouts ex- 
perienced significantly more unexpected 

2 Copies of the questionnaires and data relative to 
their construction may be obtained from the author 
on request, 
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TABLE 1 


Scores or 1,439 Survivors, 413 Dropouts, 183 Acapemic Drorovrs, AND 
230 Nowacapeaic Dropouts 


Satisfactions expected 

Stresses expected 

Satisfactions experienced 

Stresses experienced 

Expected satisfactions which were 
experienced 

Expected satisfactions not 
experienced 

Unexpected satisfactions which were 
experienced 

Unexpected satisfactions not 

perienced 


x 
Expected stresses which were 
rienced 


experi 
Expected stresses not experienced 
Unexpected stresses which were 
ex 


. stresses not experienced 
‘onfirmed expectations 
Unconfirmed 


Stresses than either the survivors or the 
nonacademic dropouts, 
nificantly fewer stresses than the nonacademic 
dropouts. 

The nonacademic and total dropout groups 
exceeded survivors in numbers of satisfac- 
tions which were neither expected nor ex- 


$ ratios 
— a 
A-B | A-C | A-D | c-p 
91 -57 1.76 1.71 
1 1.13 1.72 2.119 
7.85%% | 5.47** 6.13. 12 
32 68 1s Ot 


2.80% 3.41127 1.40 

2.60% 77 2.96%% | 1.47 
16 70 39 83 

1.24 1.11 2.67% 2.81 


82 |2.06* | 91 | 2/338 
148 | 55 |1.60 | 67 
5.433.514.0724 
4.53% 2.54% 37% 10 
2.17% |1.96* 10 92 


perienced. Nonacademic dropouts also ex- 
ceeded academic dropouts and survivors in 
the number of stresses expected but not ex- 
perienced. The survivors omitted fewer items 
than the dropouts. 

The point-biserial correlations between 
withdrawal and each of the 14 questionnaire 


All survivors and 


All survivors and All survivors and 


Score all dropouts academic dropouts 1 
W = 1,852) (N = 1,622) 07 1.60) 
Satisfactions expected +.022 +.015 +.045 
Stresses expected +.012 —.029 +.043 
Satisfactions experienced PAUA a —.148** —.164** 
Stresses experienced +.008 +.018 +.004 
Expected satisfactions which were experienced root =.105* =.143"* 
Expected satisfactions not experienced +.187** +.165** +.143** 
Unexpected satisfactions which were experienced —.067* AU ihe —.034 
Unexpected satisfactions not experienced +.063* +.021 +.077* 
Expected stresses which were experienced —.004 —.018 +.010 
Expected stresses not experienced +.030 +.029 +.070* 
Unexpected stresses which were experienced +.019 +.056 —.021 
Unexpected stresses not experienced —.035 —.014 —.040 
Confirmed expectations —.129** —.104** —.106** 
Unconfirmed expectations +.112** +.084** +.097** 
* 
"p S01. 
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scores are presented in Table 2. In each 
group, 5 of the 14 coefficients, though small, 
are significant at the 01 level. The 5 all 
pertain to satisfactions which were or were 
not experienced and to confirmation of ex- 
pectations. 

No relationship was demonstrated in any 
of the groups between withdrawal and (a) 
expected satisfactions, (6) expected stresses, 
(c) experienced stresses, (d) expected stresses 
which were experienced, (e) unexpected 
stresses which were experienced, and (/) 
unexpected stresses which were not experi- 
enced. 


Discussion 


The results of this study supported March 
and Simon’s (1958) postulates relative to 
satisfaction, but gave no support in relation 
to stress. The predicted effect of expectations 
was observed in the area of satisfactions, but 
received uncertain support in regard to 
stresses. 

Contrary to Herzberg et al.’s (1959) view, 
it did not appear that student withdrawal 
was primarily due to dissatisfiers. It did, 
however, seem clear that there were dif- 
ferences between satisfiers and dissatisfiers, 


as Herzberg indicated, although no attempt 


was made here to validate Herzberg’s 
measures. 

The measures used in this study also did 
not lend themselves to direct implications 
for Ross and Zander’s (1957) position. It 
would, however, be anticipated from their 
theory that dropouts, more than survivors, 
would report interference of school with their 
pursuit of extramural satisfactions, but this 
was not found to be the case. 

The negative relationship between satisfac- 
tion and withdrawal, posited by. Katzell 
(1964) and by Vroom (1964), received sup- 
port from these findings. Their respective 
views concerning the relation of dissatisfac- 
tion, or negative valence, to withdrawal were 
not supported. The absence of stress did not 
have the favorable consequences associated 
with satisfaction, although the absence of 
satisfaction tended to have negative con- 
sequences, especially when the satisfaction 
was expected. 


Despite the fact that significant correla- 
tions were obtained, so small a fraction of 


volved, the saving of even a small percentage 
of those who would otherwise withdraw from 
schools of nursing could contribute sub- 
stantially in absolute numbers to the allevia- 
tion of the nursing shortage. 


Conciustons 


1. Withdrawal was inversely related to 


experienced satisfaction. 
2. Withdrawal was inversely related to the 


confirmation of expectations, especially in the 
area of satisfactions, 

3. Withdrawal was unrelated to experienced 
stresses, whether expected or unexpected. 

4. Stress and satisfaction did not operate 
as opposite ends of the same continuum. 
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RELATION BETWEEN ETHNIC ORIGIN AND GSR 


REACTIVITY IN PSYCHOPHYSIOLOGICAL 
DETECTION 


SOL KUGELMASS ano ISRAEL LIEBLICH 
Hebrew University of Jerusalem, Jerusalem, Israel 
The psychophysiological data of 3 Jewish and 1 Bedouin samples were ana- 


lyzed in connection with the detectability of a previously chosen card. Ss of 
Near Eastern origin tended to show lower pulse rate, higher basic skin con- 


ductance, and lower relevant GSR reactivity. It is Proposed that psycho- 
cultural differences are likely to be the explanation for the consistent differen- 
tial GSR reactivity to the relevant stimuli. 


Attention has recently been drawn to the 
significant differences in skin conductance 
found in different ethnic and racial groups. 
Johnson and Corah (1963) reported that 
Negro subjects (Ss) showed lower skin-con- 
ductance values than did Caucasian Ss. This 
finding was supported by further data from 
Bernstein (1965), Johnson and Landon 
(1965), and Malmo (1965). A study by 
Lazarus, Tomita, Opton, and Kodama (1966) 
compared the skin conductance of Japanese 
and American students as they watched a 
stressful film. These data suggest that the 
Japanese may have higher skin conductance. 
More limited data reported by Wenger in 
Johnson and Landon (1965) as well as by 
Malmo (1965) suggest that oriental Ss may 
have dower skin conductance than Caucasian 
Ss. 

Less interest has been focused on analogous 
differences in GSR reactivity. Sternbach and 
Tursky (1964) reported skin-potential dif- 
ferences related to ethnic background during 
the use of electric shock stimulation in con- 
nection with pain-threshold determination. 
Johnson and Landon (1965) reported racial 
differences in GSR reactivity to the initial 
presentation of a tone stimulus, but this 
difference became insignificant upon correc- 
tion for prestimulus base-line conductance. 
Lazarus et al. (1966) did not analyze their 
data on a time scale comparable to the usual 
GSR analysis, but did report that “Unlike 


1 The research reported in this paper has been 
sponsored by the Air Force Office of Scientific Re- 
search, through the European Office, Aerospace 
Research, United States Air Force, under Contract 
AF 61 (052)-839. 


American samples, the Japanese did not show 
marked stimulus-controlled variation in skin 
conductance during the stress film. [p. 
631]. 

In this laboratory, differences in GSR re- 
activity were found to exist between Jewish 
Ss of different ethnic backgrounds (Kugel- 
mass, 1963; Kugelmass & Lieblich, 1966). 
Jewish Ss of Near Eastern origin (born in 
the Moslem countries of the Mediterranean 
basin, Morocco, Yemen, Iraq, or Iran) 
showed less GSR reactivity than Jewish Ss 
born in America, Europe, or Israel. These 
differences in GSR reactivity were observed 
in connection with the response to the name 
of the relevant chosen card during the sub- 
sequent interrogation part of a card-detection 
experiment. The purpose of the present report 
is to present supplementary analyses of the 
above-mentioned data as well as to report 
new data derived from independent samples. 


MertHop 
Original Sample 


This sample has been previously described in 
Kugelmass (1963) and Kugelmass and Lieblich 
(1966). The Middle Eastern Ss exhibited less GSR 
reactivity in the three different stress conditions, 
reaching statistical significance in one of them. In 
addition, 8 of the 36 Ss turned out to be non- 
reactors in all three experimental tests. All eight 
of these total nonreactors were of Near Eastern 
origin, and no Western § showed this pattern. It 
was of interest that no significant difference in 
GSR reactivity was observed in the responses to 
the other irrelevant cards, 

Base-line data. (a) Conductance: Values of skin 
resistance (ohms) were taken 30, 60, 90, and 120 
sec. after the beginning of the first 3-min. rest 
period which opened each experimental examination. 
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These values were converted to conductance in 
micromhos, and averaged for each S, and ibea 
averaged over Ss. The mean conductance value lor 
the Near Eastern Ss was 19.74 micromhos, and 
1738 micromhos for the Western group. The dii- 
ference was not statistically significant at the 05 
level (t = 1.26 for 13 Western Ss and 25 Near 
Eastern Ss). Examination of the data of the sub- 
group of total nonreactors showed a mean basic 
conductance of 2240 micromhos which is signifi. 
cantly higher (t = 4.10, with a combined variance 
estimate of 740, a < 01). 

(b) Pulse rate: Five-second samples of pulse-rate 
data were taken at 30, 60, 90, and 120 sec. after 
the beginning of the first 3-min. rest period. The 
mean pulse rate of the Near Eastern Ss was 8049 
beats/min, while the rate for the Western Ss was 
3645 beats/min. With a combined variance esti- 
mate of 145.36, this difference between means is 
not significant at the OS level (f = 143). 

It should be added that no significant difference 
in years of education or intelligence stanine scores 
was found to exist between the two groups of 
Ss who had a mean number of 6.30 yr. of education 
(S = 1.75). 


Blood-Pressure Cuff Sample 


This sample was also described in Kugelmass 
and Lieblich (1966). The 40 Ss of this additional 
sample participated in an experiment in which GSR 
readings during card detection were made both 
with and without an accompanying inflation of a 
blood-pressure measurement cuff. The data to be 
reported here are all from the GSR condition which 
did not include this additional variance and thus 
are more comparable to the rest of the data to be 
presented. 

GSR differential reactivity date. The mean re- 
actions to the “hot” relevant chosen card were 
analyzed as in the original sample, and Ss were 
categorized as reactors if their mean response was 
at least 1,000 ohms. The relation between ethnic 
background and this reactivity may be seen in 
Table 1. 

A chi-square analysis of the data indicates that 
a significantly higher proportion of Near Eastern 
Ss are nonreactors, X? = 3.30 (æ < 05, one-tailed). 
Once again no significant difference was found in 
GSR reactivity to the irrelevant cards. 


TABLE 1 


RELATION BETWEEN ETHNIC ORIGIN AND GSR 
Reactivity To THE “Hot” CARD 


Ethnic origin 
Western 10 5 
Near Eastern 8 

Total 18 40 


Note. R = reactor; NR = nonreactor. 


Daw-line data. (s) Conductance: Condectance 
values were obtained as in the original sample. The 
mean thin-conductance value of the 14 Near Eastern 
Ss was 17,92 with 18.16 micromhos as a mean for the 
16 Westerners. The difference between thee two 
means with a combined variance estimate of 7.90 
was not significant at the OS level (f = 0.50), 

(b) No pube-rate data had been collected ia 
this experiment. 

There was no significant difference in education 
or Intelligence stanine between the Near Eastern and 
Western parts of the sample which had a mean 
number of 9.30 yr. of education (S = 2.06). 


Criminal-Suspect Sample 


As a follow-up to the two experiments described 
above a sample of criminal suspects was examined 
at the central laboratory of the Israel Police Force. 
The standard card- teu procedure used in the above 
experiments was incorporated into the regular 
police Interrogation procedure of 62 criminal suspects 
connected with serious offences, The only other dif- 
ference in procedure derives from using the police 
apparatus which was a standard three-channel 
Stoelting polygraph. This limited the analysis in 
that no estimate of skin conductance was obtained 
and changes in millimeter deflections serve as an 
indication of GSR reactivity. 

GSR differential reactivity data. GSR responses 
were defined as the maximal change in millimeters 
from base line within 8 sec. of the presentation of 
the number. Mean responses to each of the cards 
were calculated for the Near Eastern Ss (N = 39) 
and the Western Ss (N = 19). The records of four 
of the Ss were technically inadequate. The mean 
response of the Near Eastern Ss to the relevant card 
was 12.51 mm. while for the Western Ss it was 
21.47. With a combined variance estimate of 303.11 
the difference between means is significant at the 
os level (t = 1.82, one-tailed). No significant differ- 
ence was found in GSR reactivity to the irrelevant 
cards (Near Eastern, X = 7.68, Western, X =7.56, 
combined S = 28.91). 

Base-line data. (a) This Stoelting model does not 
provide for measuring basic conductance. 

(b) Pulse-rate data: The pulse-rate data were 
obtained by counting the spikes in the blood- 
pressure cuff recording which is a standard feature 
of this apparatus. Five-second samples of pulse- 
rate data were measured at 30, 60, 90, and 120 sec. 
after the beginning of the card-test interrogation 
during the use of the blood-pressure cuff. The mean 
pulse rate of the Near Eastern Ss was 95.99 beats/ 
min, while the rate for the Westerners was 101.90. 
With a combined variance estimate of 364.07, this 
difference between the means is not significant at 
a=.05 (t = 1.13, 39 Near Easterners, 20 Western- 
ers). 


2The authors wish to thank the Israel Police 
Force and especially M. Kaplan, A. Ben-Ishai, and 
A. Opatowski for their help and cooperation. They 
also wish to thank G. Ben-Shakhar for expert 
assistance. 


160 Sor Kucetmass AND ISRAEL LIEBLICH 


Bedouin Sample 


Following the previously mentioned experiment 
which showed the same general trends, it was 
decided to obtain a sample of Ss who approximate 
the phenotype of the Near Eastern natives as 
closely as possible. Such Ss were available in 
Bedouin Moslem tribes which dwell in the Israeli 
Negev desert. A detailed description of these 
Bedouin tribes may be found in Groen, Balogh, 
Levy, and Yaron (1964). It has been noted that 
there exists a rough gradient of decreasing modern 
influence on Bedouin culture in moving from the 
northern Negev to the southern part where few 
Jewish settlements exist as yet.3 As a first step it 
was decided to draw a sample which would be at 
least roughly comparable to the previous samples 
in having some minimum education in a formal 
setting. 

Subjects. Ten male Bedouin Ss of the Arab El 
Houzaiel tribe who live close to the Jewish Kibbutz 
settlement Shuval in the Northern Negev were 
tested. All of these tribesmen were engaged in 
agricultural work. The mean level of formal educa- 
tion was 5.8 yr. of school. 

Procedure, The Ss were tested with the Stoelting 
apparatus obtained from the Police Laboratory as 
well as with the constant-current (20 ya.) battery- 
operated GSR apparatus used in the original and 
blood-pressure cuff experiments described above, 
which employs Beckman electrodes, on the volar 
surface of the first and fourth fingers of the left 
hand. Sanborn paste is used to ensure standard con- 
tact. The testing took place in the main room of 
Sheik Souleiman EI Houzaiel on a cool January day 
(17° C). The Ss were tested using the two apparatus 
setups in a rotated order. The standard card test 
(Kugelmass & Lieblich, 1966) was administered in 
Arabic through the use of an Israeli police interpre- 
ter. The research had been presented to the Ss as 
part of a research effort of the Hebrew University to 
investigate physiological responses of different ethnic 
groups in Israel. 

GSR differential reactivity data. All of the 10 
Bedouin Ss were found to be nonreactors by the 
criterion previously defined. The records tended to 
be generally “flat” and even the mean response to 
the hot card was only 187.50 ohms. Because of this, 
no significant card detection was possible. As an- 
other means of examining GSR reactivity it seemed 
worthwhile to check the “orienting response” of 
the Ss to the first buffer item that is used to open 
all interrogation sequences. The mean orienting re- 
sponse of the Bedouin Ss was 1000.00 ohms, that of 
the Near Eastern of the original sample (most 
comparable in level of formal education) was 
1843.48 ohms, and of the Western Ss was 1730.00 
ohms. Neither the difference between the Bedouin 
and the Westerners (combined variance estimate 
2,188,812, £= 1.04) nor the difference between the 


Lt. Col. Sasson Bar-Zvi, former military gover- 


nor of the Negev region, personal communication, 
1966. 


Bedouin and the Near Easterners (3,974,465, t= 
1.04) was significant at the Os level. 

Base-line data. (a) Conductance: The average 
basic skin conductance of the 10 Bedouin Ss was 
found to be 31.39 micromhos. With a combined 
variance estimate of 79.55, the difference between 
this mean and the mean of the Jewish Near Eastern 
Ss, 19.74, is significant at the .01 level (t= 3.50). 
The Western mean was even lower, 17.38, and 
with a combined variance estimate of 109.47 is 
also significantly lower at the .01 level (f 3.11). 

(5) Pulse rate: Pulse-rate data were obtained 
from the Stoelting records as described in the 
criminal-suspect sample. The mean pulse rate of 
the 10 Bedouin Ss was 69.50 beats/min. This is 
significantly lower than the Near Eastern mean, 
80.49 (a C05, f 2.42, with a combined variance 
estimate of 155.78), and the Western mean, 86.45 
(a < 01, t= 3.08, with a combined variance esti- 
mate of 205.87). 


Discussion 


The ethnic differences that caught the 
authors’ attention in the first two experiments 
were primarily concerned with a particular 
kind of GSR reactivity to the relevant card. 
Following the results described above it 
seemed important to consider the more sus- 
tained base-line state of the organism in ad- 
dition to the relatively transient reactions 
of the S to specific stimuli. 

The data indicate that there may be a 
tendency for the Near Eastern Ss to have 
higher skin conductance. This is clearly seen 
in the Bedouin sample. Johnson and Corah 
(1963) considered two aspects of the skin 
in connection with the racial differences they 
found. They did not consider the skin color 
per se as a likely explanation, and Johnson 
and Landon (1965) were not able to find 
significant differences in the number of active 
eccrine sweat glands. While Malmo’s (1965) 
more limited data do suggest a negative cor- 
relation between skin conductance and pig- 
ment concentration, the present data are not 
in accord with this suggestion since Near 
Eastern Jewish and Bedouin Ss are darker 
on the average than Western Jews. Another 
possibility suggested by Johnson and Corah 
(1963) related lower conductance to a 
thicker stratum corneum. Superficially, at 
least, the present data are not in accord with 
this position. The Bedouin group with by 
far higher skin conductance is the only group 
of manual workers who might be expected to 
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have a thick, calloused, stratum corneum. 
Of course a more precise test of this issue is 
necessary. Of possible greater relevance to 
the present data is a finding mentioned by 
Johnson and Landon (1965): K. M. Taylor 
found that “Oriental Ss had greater chloride 
concentration than Caucasian Ss [p. 328].” 
The higher conductance finding in the 
Bedouin may reflect adaptation to some 
aspect of the immediate environment, which 
involves the amount of sweat secreted, its 
chemical composition, or both. 

It is also possible to view these conductance 
differences as reflecting a difference in basic 
activation. An examination of the pulse-rate 
data which is another index of activation sug- 
gests caution in such an approach. The pulse- 
rate data would actually suggest the opposite 
since the Bedouin Ss had a significantly lower 
pulse rate and there is some tendency toward 
the same in the Near Eastern Jews. It is, of 
course, difficult to evaluate the comparative 
psychological states of the Ss during the 
examinations, but it does seem plausible to 
assume much less familiarity and possibly 
greater stress for the Bedouin Ss. On the 
other hand, medical research has focused on 
the lower risk of cardiovascular disease ob- 
served in Yemenite Jews and Israeli Bedouin 
(Groen et al., 1964). This might reflect a 
low level of cardiovascular response to stress 
states. Another way of describing these find- 
ings would be to extend the notion of “re- 
sponse stereotypy” used in connection with an 
individual’s psychophysiological responses 
over situations (Lacey & Lacey, 1958) to 
ethnic-group response stereotypy- The rela- 
tively low variance found in the Bedouin 
records might even suggest a genetic mechan- 
ism for such a notion. 

GSR reactivity should be considered in 
the light of the tendency to find higher base- 
line skin conductance in the Near Eastern 
Ss. All three studies dealing with different 
Jewish samples indicated significant ethnic 
differences in reactivity to the “relevant” 
card as opposed to little or no difference to 
the irrelevant card. In the case of the 
Bedouin, all GSR reactivity to the stimuli 
was relatively low although an appreciable 
“orienting” reaction was elicited. Previous 
analysis had suggested some relationship be- 


TABLE 2 

CORRELATIONS BETWEEN Basic CONDUCTANCE AND 
BOTH RELEVANT AND IRRELEVANT 

GSR Reactiviry 


Base line 
(conductance) | Base line and 
and cold card “hot” card 
(ohms) 
Original sample 
(N = 36) —0.363° —0.370° 
Blood-pressure cuff 
(N = 40) —0.155 —0.245 
*a <05. i 


tween relevant-card reactivity and base-line 
skin conductance when considering a special 
nonreactive subgroup of the Near Easterners 
as compared with the Westerners (Kugelmass, 
1963). Correlations were therefore calculated 
for the relationship between base-line skin 
conductance and both relevant- and irrele- 
vant-card GSR reactivity for the original and 
the blood-pressure cuf samples, and pre- 
sented in Table 2. 

All of these Pearson correlations were 
negative and low, reaching significance only 
in the original sample. This direction is what 
may have been expected from a traditional 
conception of the Law of Initial Values. The 
recent findings of Hord, Johnson, and Lubin 
(1964) as well as Johnson and Landon (1965) 
obtained significant correlations in the op- 
posite direction, when using conductance 
changes in micromhos. In the light of their 
findings it is difficult to conceive of an index 


TABLE 3 


MEAN CONDUCTANCE CHANGES IN MICROMHOS AND 
VARIANCE ESTIMATES TO THE RELEVANT CARDS 
IN THE DIFFERENT SAMPLES 


Middle Eastern Western 
Sample sample sample 
Original 
* 523 630 
8 057 024 
Blood-pressure cuff 
xX .246 371 
S: .054 050 
Bedouins 
be .216 = 
Ss Ail — 
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that completely frees the measure of re- 
activity from the tonic base-line conductance 
within the GSR channel. Nevertheless, con- 
ductance change in micromhos was calcu- 
lated for the relevant-card data of the origi- 
nal, blood-pressure, and Bedouin samples and 
is presented in Table 3. The same pattern of 
differences in reactivity appears when using 
this transformation. 

Given the above findings it is difficult to 
conceive of the base-line condition being an 
important factor in explaining the consistent 
differential GSR reactivity pattern that 
emerged in all three Jewish samples. There is 
some possibility that the influence of the 
base line comes into operation only after 
surpassing a minimal level of reactivity which 
is more likely in the greater response to the 
relevant card. This should produce a higher 
negative correlation between relevant- card 
reactivity and base-line conductance than for 
the irrelevant - card reactivity. The correla- 
tions are too low to supply a clear test of 
this notion, but this should be considered 
further. On the whole, the data would sug- 
gest that this would only be a minor source 
for the differential effect, which would ap- 
pear to be a result of psychological factors. 

It is possible to conceive of a range of 
psychological variables that might bring 
about such a differential reaction. As a con- 
servative position it seems best to consider 
all possible factors that produce GSR re- 
sponses such as shifts of focus of attention, 
concentration, effort, etc. rather than limit 
theorizing to the specific lie-detection aspects 
of the experimental situation. This might be 
differentially activated by any of the demand 
characteristics of the card-detection task 
which might vary in meaning for each in- 
dividual and probably systematically for 
groups of different cultural background. A 
study from this laboratory has recently man- 
ipulated this differential GSR reactivity by 
variations in relevance of stimuli to the 
person (Lieblich, 1966). 

In summary, there appear to be at least 
two issues to be considered in relating ethnic 
differences to psychophysiological reactivity 
which should be considered further. The 
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present data support the notion of consistent 
differences in basic activation as indexed by 
skin conductance and possibly pulse rate 
which do not go in the same theoretically 
expected direction. Possibly related to these 
base-line differences are consistent differences 
in short-term GSR reactivity which appear 
to be more closely related to psychological 
factors. 
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EFFECT OF PERCENTAGE AND SPECIFICITY OF FEEDBACK 
ON CHOICE BEHAVIOR IN A PROBABILISTIC 
INFORMATION-PROCESSING TASK * 


IRWIN L. GOLDSTEIN? JOSEPH T. EMANUEL, ao WILLIAM C. HOWELL 
Ohio State University 


Proficiency in choosing the better of 2 sources of information was studied as 
a function of J variables: percentage feedback, specificity of feedback, and 
difficulty of discrimination between sources. 210 Ss performed 200 trlah under 
1 of 21 conditions, The information pool consisted of 2 ditinct distributions 
of numbers from which random selections were made and dleplayed as read- 
ings on a pair of meters. Only 1 reading was shown per trial, and 8 judged 
whether it or the other meter reading (not shown) was higher. Chokes could 
thus be based upon both the momentary state of 1 meter and familiarity with 
the distributions from which information was drawn. Contrary to typical 
findings, percentage feedback had no influence on acquisition or performance; 
paradoxically, a specificity effect appeared when feedback was provided. 


The term knowledge of results (KR) refers 
to information provided for an individual or 
group of individuals concerning the outcome 
of a response or group of responses. Research 
on a variety of tasks including the ergograph 
(Arps, 1917), graded handwriting (Gilliland, 
1925), and complex arm-leg coordination 
(Lindahl, 1945) has demonstrated that per- 
formance with KR is superior to performance 
without KR. 

In many realistic settings, however, such as 
complex military and industrial systems, it is 
difficult or impossible to provide complete or 
accurate KR after each response. An issue of 
some practical importance, therefore, is how 
often—or on what percentage of responses— 
feedback must be provided in order to main- 
tain its facilitating influence. Research ad- 
dressed to this problem has yielded ambiguous 
results. Kimble (1961), for example, sum- 
marizes the findings of several studies con- 
cerned directly with percentage of feedback 
by stating that performance “. . . is usually, 
but not always poorer... Ip. 161]” for 
partial than for 100% conditions. On the 

1The research reported in this paper was carried 
out at the Human Performance Center and was 
sponsored by the Aerospace Medical Research Lab- 
oratories, Aerospace Medical Division, Air Force 
Systems Command, Wright-Patterson Air Force 
Base, Ohio, under Contract No. AF 33(615)-2248 
with the Ohio State University Research Founda- 
tion. Further reproduction is authorized to satisfy 
the needs of the United States Government. 

2 Now at the University of Maryland. 


other hand, other investigators have reported 
that performance is not seriously impaired on 
simple conditioning and reaction-time tasks 
until reinforcement or feedback is withheld on 
70-80% of the trials (McCormack, Bending, 
& McElheran, 1963; McCormack & McEl- 
heran, 1963). Goldstein, Southard, and Schum 
(1967) obtained a similar result using a com- 
plex decision-making task; again, no decre- 
ment appeared until KR was withheld on 
67% of the decision responses. 

Part of the confusion with respect to effi- 
cacy of KR may stem from the fact that it is 
likely to affect more than one organismic 
process. Bilodeau (1966), for example, dis- 
tinguishes three potential functions which 
might be involved—to one or another degree 
—in any specific instance of KR: a motivat- 
ing function, a reinforcing function, and a 
directive function. Each of these processes 
might be affected somewhat differently by 
withholding feedback on some proportion of 
trials. It could be, for example, that the per- 
centage required to maintain a high level of 
motivation is fixed and relatively small, while 
that necessary for proper direction varies with 
such factors as task difficulty, specificity of 
instructions, and availability of other sources 
of guidance. 

While it is difficult, if not impossible, to 
isolate one of these KR functions completely 
in any experimental design, the primary em- 
phasis of the present study was upon the 
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directive or injormational facet, Thus, the 
importance of percentage of KR was examined 
in light of several other variables which might 
be expected to control the informational value 
of feedback provided. These variables were 
task difficulty and specificity of KR. 

The relevance of task difficulty to the per- 
centage of KR issue is rather obvious. If the 
information in KR is of any value for response 
guidance, it should become increasingly im- 
portant to the subject (S) as his course be- 
comes less clear. Consequently, percentage of 
ſeedback should be of more significance as 
difficulty is increased. 

On a purely logical basis, specificity of KR 
also seems to bear directly upon the informa- 
tional value of feedback; more specific KR 
should provide more precise guidance of be- 
havior. Actually, however, the relationship is 
not nearly so simple. Evidence to date sug- 
gests that people are limited in their capacity 
to use highly specific feedback information 
(Eriksen, 1958; Goldstein et al., 1967); while 
some specificity may be beneficial, further 
increases in detail may actually degrade per- 
formance (Annett, 1961; Crafts & Gilbert, 
1935). Since it is thus difficult to anticipate, 
for any particular set of conditions, how well 
specific KR will be utilized, it is virtually im- 
possible to predict the way in which speci- 
ficity should interact with percentage of KR. 
Nevertheless, if both factors appear orthog- 
onally in a design, it should be possible to 
determine whether the specific information is 
used and, if so, whether it increases in useful- 
ness as it occurs on a greater percentage of 
trials. This was the approach taken in the 
present study; two distinct levels of speci- 
ficity were combined factorially with four per- 
centages of KR and three levels of difficulty. 

A task originally developed by Howell and 
Funaro (1965) provided a useful vehicle for 
the study of KR variables. In this task, the 
display consisted of two meters (A and B) 
upon which could be presented sample num- 
bers (readings) drawn from two distinct 
number populations. On each trial a sample 
number was drawn from both A and B, but 
only one of these was displayed. The § was 
required to guess whether or not it was a 
higher number than that sampled from the 
other (not shown) population. These stimulus 
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readings were displayed alternately on the 
two meters. Thus, as more and more stimulus 
readings were presented, S could gain increa- 
sing familiarity with the distributions of num- 
bers from which both the A and B readings 
were drawn. He could gain even more in- 
formation about these distributions, however, 
to the extent that KR was provided and he 
was able to use it. Difficulty of discrimina- 
tion could easily be varied by manipulating 
the differences between the A and B popula- 
tions. 

The Howell and Funaro (1965) study in- 
dicated that & comes to establish certain 
numbers as cutoff points for each stimulus dis- 
tribution such that whenever a reading ap- 
pears which is below that cutoff point, his 
choice is the alternate meter. The present 
investigation examined the effects of percent- 
age of KR (0%, 33%, 67%, 100%) and 
specificity of KR on $’s choice of cutoff points 
for three levels of input distribution discrimin- 
ability. Specificity of KR was included at two 
levels by either permitting S to see both the 
A and B readings after his response to one 
of them (specific feedback) or by providing 
him simply with right-wrong information 
(nonspecific feedback). 


METHOD 


Subjects. The Ss were 210 male undergraduate 
students enrolled in an introductory psychology 
course, They were divided at random into 21 groups 
of 10 each. Each S served for 200 trials. 

Apparatus. The S was seated at a table in front 
of the display and response keys. The room was 
completely separated from the experimenter (E). 
This display consisted of a pair of microammeters 
with scale markers reading from 0 to 200 microamps 
in units of 5. The meter on the left was labeled 
“A” and the one on the right B.“ The E was 
able to adjust and present the readings on either 
dial by adjusting potentiometers. A pair of switches 
located beneath each dial was used by S in making 
his choice of dials. The response activated an 
A or B light in E's booth and the choice was then 
recorded manually. Another set of switches permitted 
E to present feedback by activating either a green 
light to signify a correct choice, a red light to 
signify an incorrect choice, or a combination of the 
appropriate light and the reading on the alternate 
meter (ie., whichever one was not used to present 
the stimulus on that trial). 

Independent variables. The independent variables 
investigated in this study were: (a) specificity of 
feedback, (b) percentage of feedback, and (c) dif- 
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the draws; for the 3-unit separation the population 
with the larger mean was higher on 67% of the 
draws; and for the S-unit separation it was higher 
on 78% of the draws. The readings (numbers) to 
be used on any trial were drawn randomly and 
independently from the two populations; one num- 
ber from each population (A and B) constituted a 
trial. One restriction placed on the sampling pro- 
cedure was that there be no tied readings; 
ever this occurred, that pair was discarded and a 
new pair was drawn. The sampling procedure was 
carried out by a computer program which also 
supplied data sheets with all pairs of readings 
— i E's use in manual presentation 
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of the 
gS. 
Procedure. The S was seated in front of the dis- 
play and corresponding response keys. He was 
structed that the two meters represented two distinct 
normal distributions and that each trial consisted of 
a random draw of two numbers, one from each of 
the two distributions. He was told that 


the manner in which he was to 
trial, E set the value of the potentiometers and 
presented the reading on one of the meters. After 
S responded, E recorded the response and then pre- 
sented the feedback required by the experimental 
condition. On successive trials the value presented 
appeared alternately on Meters A and B so that S 
would have equal opportunities to sample both 
meters. Each presentation and subsequent recording 
of the response took approximately 10 sec. Each S 
was required to make 200 choices in a single session 
which lasted approximately 1 hr. After every 50 
trials, summary feedback was provided and a 2-min. 
rest period was introduced. 
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between the highest B value at which A was 
chosen and the lowest B value at which B was 
chosen; for the B distribution, it was midway 


Average Absolute Deviation 


Fro. 1. Average discrepancy between empirically 
determined and theoretically optimum cutoff points 
for three levels of input discriminability. 
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1134) = 14.00, p < .001. Also apparent from 
Figure 1 is the fact that as the two meter 
distributions become more discriminable, the 
amount of improvement in performance in- 
creases; that is, when the two distributions 
are widely separated, performance improves 
much more noticeably over blocks than when 
the two distributions are closely aligned. 
This significant interaction of trial blocks with 
discriminability, F(14/1134) = 5.85, p< 
001, warrants further comment. The per- 
formance of the low-discriminability group 
is initially quite good and shows very little 
improvement after the early trials, appearing 
to approach an asymptotic value of 2.0 de- 
viation units during the third block of trials. 
At this discriminability level the two dis- 
tributions overlap enough to permit S to 
operate at a near-optimal level by adopting 
a single cutoff for the two distributions. He 
appears to establish such a cutoff point very 
early and, having little to gain by adopting 
a two-cutoff strategy, stabilizes on this value. 
At the higher levels of discriminability, on 
the other hand, adoption of a single cutoff 
value must result in an appreciable reduction 
in score, and S wisely abandons this strategy 
in favor of the more rewarding double-cutoff 
approach. Obviously, however, a number of 
samples must be drawn before he is in a 
Position to establish separate A and B cutoffs; 
thus, it is not surprising that he initially per- 
forms rather poorly but shows rapid and 
marked improvement as he gains information 
upon which to base a discrimination. This 
general improvement over blocks and the 
obvious deviation, especially in the higher dis- 
criminability level groups, from a single cut- 
off strategy serve to reaffirm the Howell and 
Funaro (1965) conclusions. 

Figure 2 summarizes the influence of per- 
centage of feedback on deviation scores. The 
obvious triviality of this effect is supported 
by the analysis of variance which indicated 
nonsignificance (p > .05) for percentage of 
feedback (33%, 67%, and 100%) and all 
its interactions. The 0% feedback condition 
was not included in the analysis, but it is 
apparent from Figure 2 that the 0% condi- 
tion does not differ from any of the other 
conditions. It seems especially surprising that 
the 0% feedback group was able to approxi- 
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Blocks 


Fic. 2. Average discrepancy between empirically 
determined and theoretically optimum cutoff points 
for the four levels of percent feedback. 


mate an optimal cutoff strategy as well as the 
100% feedback group. It is important to 
note, however, that the 0% feedback group 
does receive information about each distribu- 
tion through the alternate presentation of 
readings on Dial A and Dial B on successive 
trials; this information, if used, is adequate to 
permit inference of an optimum strategy. 

The conclusion to which one is led by the 
percentage of KR results, then, is that, con- 
trary to the growing weight of evidence in 
support of a critical 20-33% KR level, KR 
can indeed be completely superfluous in 
maintaining performance. Ammons (1956) 
noted that in most tasks S can find some 
guideline to use in establishing a response 
criterion. If the stimulus situation actually 
contains enough information to enable him 
to perform optimally, the response criterion 
which he sets for himself may be equally as 
effective as one administered externally. 
Such, of course, was the nature of the present 
situation: information provided by the in- 
structions and the alternation procedure seems 
to have been utilized maximally without the 
aid of KR. 

Since the information available in KR 
seems generally to have been unnecessary for 
optimal task performance, it is not surprising 
that predicted interactions of specificity and 
difficulty with percentage of KR failed to 
materialize. This finding strengthens even 
further the conclusion that KR can be a 
very impotent directive factor; even when 
the information is highly specific and task 
difficulty is very high, it matters little how 
much feedback is provided. 


Errects or FEEDBACK 


A seemingly paradoxical finding arises, how- 

‘ ever, in connection with KR specificity. The 

* fact that KR is unnecessary, as shown by the 

percentage of KR data, suggests that S com- 

pletely ignores this source of information. 

Figure 3 indicates that this is definitely not 

the case. That he can and does use KR in- 

. formation to improve performance is indicated 

by the fact that specific feedback conditions 

were more effective than nonspecific feedback 

conditions in guiding him toward the optimal 

cutoffs. Moreover, Figure 3 indicates that 

the cutoffs established on Meter A were much 

nearer the optimal cutoff than were the cut- 

offs established by the same feedback condi- 

tion groups on Meter B (this would not occur 

if S were ignoring the information com- 

pletely). The analysis of variance supported 

X both of these apparent differences: both the 

feedback specificity effect, F(1/162) = 7.67, 

p< .01, and the meter effect, F(1/162) = 

11.51, p < .001, were highly significant. The 

latter finding seems to indicate that Ss were 

focusing more of their attention on the left 

dial than they were on the right dial for 
reasons that are still unexplained. 

It must be concluded, therefore, that al- 
though S will rely to a degree upon KR 
information when it is available, he will bene- 
fit from it only to the extent that it represents 
some critical level of specificity. In the ab- 
sence of KR, he seeks other sources of guid- 
ance which, it would seem, may be equally 
as useful in directing behavior. This conclu- 
sion supports the notion of a critical region of 
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specificity for KR (eg, Ammons, 1956; 
Annett, 1961), and also demonstrates man’s 
resourcefulness in gathering information upon 
which to base future responses. 

The latter point is of particular significance 
for real-life situations in which KR is either 
difficult or expensive to obtain; in such situa- 
tions it might be well to review the entire 
informational picture and to consider alterna- 
tive ways of distributing information-gather- 
ing resources, Such an analysis might well 
suggest a deemphasis of KR relative to other 
sources of information. 

A final point should be made regarding 
the implications of these data for real-life 
situations. While the main emphasis of this 
report is upon KR as information, it might 
be argued that KR is valuable from a motiva- 
tional standpoint as well. If true, one would 
expect that percentage of KR would have 
some influence upon level of performance; 
in the present task, at least, such was not 
the case. Perhaps if greater value were at- 
tached to correct responses, a percentage of 
KR effect might emerge, serving as evidence 
for a motivational influence. A preliminary 
test of this hypothesis, using monetary pay- 
offs as incentive, also failed to produce such 
an effect. 

In summary, it appears that some tradi- 
tional concepts regarding the role of feedback 
in establishing and maintaining performance 
may require revision. In the present investi- 
gation, Ss without extrinsic feedback main- 
tained a performance level similar to Ss re- 
ceiving continuous feedback. Exactly what the 
groups with 0% feedback learned which en- 
abled them to perform as well as a 100% 
feedback group remains unanswered despite 
the hypothesis advanced concerning the alter- 
nation procedure. Conceivably, the instruc- 
tions which Ss received supplied sufficient 
information to make the addition of extrinsic 
feedback completely unnecessary. 
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RELATIONSHIP BETWEEN PERCEPTUAL STYLE AND 


DRIVER REACTION TO AN EMERGENCY 
SITUATION ' 


GERALD V. BARRETT axp CARL L. THORNTON? 
Goodyear Aerospace Corporation, Akron, Ohio 


Previous investigators have found low relationships between human char- 
acteristics and accident behavior, The possibility was raised that these findings 
might be due to a number of factors including indiscriminate grouping of 
accidents and choice of predictors which have no logical relationship to 
accident behavior. In the present study 20 drivers were subjected to a con- 
trolled emergency situation in an unprogrammed automobile simulator, where 
a pedestrian (dummy) emerged from a shed into the path of the vehicle. 
Subsequent measurements of individual perceptual style, using the Rod and 
Frame Test (RFT), were correlated with effectiveness of reaction to the 
emergency situation. Series 3 of the RFT correlated .67 with reaction time, 
75 with deceleration rate, and 50 with a hit-miss criterion. Individual differ- 
ences in perceptual style and simulator discomfort resulted in shrunken 
multiple correlations of 89-95 depending on the data transformations. The 
extrapolation of the results to real-world accident behavior has implications 


especially with regard to age, sex, and use of alcohol by drivers. 


In general, the obtained relationships be- 
tween various measures of driver character- 
istics and accidents have been quite low. 
Goldstein (1961), in an extensive review 
of research on human variables assumed to 
be important in safe motor-vehicle operations, 
found low relationships between various pre- 
dictors and accident behavior. He noted, for 
example, in over 45 studies that the correla- 
tions between criterion measures and visual 
functioning (visual acuity, depth perception, 
eye-muscle balance, etc.) were never above 
20. Similarly, correlations were no higher 
than .17 between reaction time (RT) and ac- 
cident criteria in four studies using large 
samples (331-3,000 drivers). The use of 
psychomotor apparatus, paper-and-pencil psy- 
chomotor tests, sensory-perceptual tests, and 
cognitive measures shows correspondingly low 
correlation with accident criteria (Goldstein, 
1962). j 

Two points should be noted concerning 
studies attempting to validate predictors of 

1 The emergency-behavior data used in this study 
were collected under Public Health Service Con- 
tract No, PH-108-64-168 while the collection of 
Perceptual style data was supported by Goodyear 
Aerospace Corporation. 

2The authors express appreciation to Margaret 
J. Smith for statistical advice and to Patrick A. Cabe 
for computer analysis of the data. 
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accident criteria. First, the investigators have 
used very heterogeneous measures of driver 
accident behavior. There appears to have been 
little attempt to categorize or logically group 
accidents. A tacit assumption seems to be 
that all accidents have a common cause and 
therefore can be grouped together. In op- 
position to this point of view, one could 
reason that accident behavior is quite diverse 
and complex, having many causes, and only 
by an appropriate categorization of accident 
behavior would it be possible to obtain a 
better understanding of the phenomenon. 
Second, the predictors chosen often fail to 
have any theoretical or logical relationship 
to the accident behavior which is to be pre- 
dicted. The failure to categorize accidents 
logically and/or to choose a predictor which 
has a theoretical relationship to the category 
of accidents may account, to a large extent, 
for the low correlations which have been re- 
ported. 

An attempt to eliminate the category prob- 
lem was made by Barrett and Nelson (1965b, 
1966b). They tested driver reaction to what 
could be considered one category of accidents, 
a situation where the driver was forced to 
make an emergency stop because a pedestrian 
emerged suddenly into the vehicle’s path. 
This emergency situation was staged while 
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TABLE 1 


INTERCORRELATIONS OF EMERGENCY BEHAVIOR with RFT MEASURES 


r Log c Deceleration Log 
Reaction reaction Deceleration rate deceleration Hit-miss 
a time ze squared rate 
Series 1 .24 .21 —.13 —.09 —.17 —.01 
Series 2 37 35 —.36 —.32 — Al 16 
Series 3 161 E ine —.55** —.53* —.57** 34 
Orientation index 34 30 —.39 —.35 —.42 19 
Log Series 3 Ofer ASS pd —.69*** —.69*** —.70*** .45* 
Reciprocal Series 3 —.63** —.60** ae on 74 —.50* 
7 <.05, 
=p <o 
* <.001 
RESULTS emphasizes the importance of the interval 


The results of the perception test (RFT) 
were compared to (1) initial brake reaction 
time, that is, the time which elapsed from the 
emergence of the dummy until S began to 
depress the brake pedal; (2) deceleration 
rate, that is, the miles per hour per second 
which the apparent motion of the simulator 
was decreased by S as measured from the 
time of emergence of the dummy to either 
subsequent impact with the dummy, or the 
obtaining of zero velocity before collision; 
and (3) the hit-miss dichotomy found in 
most real-world studies. Series 1, 2, 3, and 
the Orientation Index from the RFT were 
compared with these three criteria. 

The results confirmed the hypothesis that 
perceptual style was significantly related to 
the ability to react to emergency situations. 
However, only one aspect of the RFT cor- 
related with the criteria. 

Table 1 shows that correlations with Series 
1, 2, and Orientation Index were not signifi- 
cantly related to emergency behavior. Differ- 
ences between these aspects of the test and 
Series 3 will be discussed later in connection 
with Witkin et al.’s (1954) findings. 

The correlation (Pearson r) between Series 
3 of the perception test and reaction time in 
the simulator was .61 (p < .01). The decelera- 
tion rate (r = —.55, p < .01) was also sig- 
nificantly related to Series 3. However, the 
relationship between the hit-miss dichotomy 
and Series 3 was only .34 (nonsignificant) 
using a point-biserial r. This latter finding 


measurement used in this study. 

Inspection of the data revealed that the 
relationships were not linear. It became evident 
that the empirical data were more nearly 
approximated by a log function for reaction 
time (see Figure 2) and a hyperbolic function 
for deceleration rate (see Figure 3) and the 
hit-miss dichotomy. Correlations increased 
in magnitude from .61 to .67 for reaction 
time, from .55 to .75 (p < .001) for decelera- 
tion rate, and from .34 to .50 (% .05) 
for the hit-miss dichotomy. A single per- 
ceptual measure, then, was able to account 
for 25-56% of the individual differences in 
ability to react to the emergency situation. 

It was apparent from the data analysis 
that discomfort, due to the experience of 
operating the simulator, affected the results. 
Multiple regressions were calculated using a 
Wherry Test Selection Program with F test 
(Wherry & Smith, 1966) for the IBM 1620 
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computer. It was found that discomfort was 
not a significant factor in initial reaction time, 
but was quite important for deceleration rate. 
Multiple Rs obtained ranged from .89 to .95 
($ < .001) depending on the particular data 
transformations involved. Table 2 shows the 
1 
multiple Rs obtained using values of x, x°, x, 
and log x as units representing various curv- 
ilinear functions that could yield a data fit. 
Two factors, perceptual differences and simu- 
lator discomfort, accounted for 79-90% of 
the variance in deceleration rate of the 
simulator. 
As mentioned previously, high correlations 
were found between emergency behavior and 


TABLE 2 


VARIABLES APPEARING IN THE MULTIPLE-REGRESSION 
Equation Usmo Varus Dara TRANSFORMA- 
TIONS—DECELERATION RATE AS RELATED TO 
PERCEPTUAL STYLE AND DISCOMFORT 


ia bl. 
Vem Multiple 
Item R 
1 2 3 4 5 
— —— — EE 
Deceleration | 1 * 
rate (DR) F, P 08 . 3 
1 1 
DR)? = — .89* 
¢ ) A log Ss 1 775 
1 1 1 1 
— ante 2 ae if my SS 95* 
DR) Sa Fc 
log (DR) = De | log Si .90* 
3 


Note.—Si = Series 1 of the Rod and Frame Test (RFT); 
S? = Series 2 of the RFT; Ss = Series 3 of the RFT; D = rating 
of subjective discomfort during simulator operation; Za = esti- 
tate of the length of time the discomfort lasted after leaving 
the simulator, 
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Series 3 only. Previous research (Witkin 
et al., 1954) showed that the RFT was not a 
unitary test. Series 1 and 2 were found to 
be similar, while Series 3 (no body tilt) 
tapped a different dimension of perception. 
Embedded Figures Test (pulling an 
item from an embedded context), often used 
as a measure of perceptual style, was found 
to be most highly related to Series 3 and 
not to Series 1 and 2. Witkin's perception 
tests were found to be on a difficulty con- 
tinuum. The Embedded Figures Test and 
Series 3 were the easiest of the perceptually 
embedding tests. Since at the time of the 
experiment it was not known how the driving 
task would compare to the difficulty levels of 
perceptual style, all three series of the RFT 
were given. 


Discussion 


The present results support the hypothesis 
that field-independent individuals are more 
effective in responding to emergency situa- 
tions. The finding of high relationships, ac- 
counting for approximately 80% of the vari- 
ance, was probably a function of a number of 
interrelated factors. 

Five such factors were considered to be 
important in obtaining significant results in 
this type of accident research: the studying 
of only one specific type of accident, using an 
improved research tool to study a specific 
behavioral sequence, unitary exposure to the 
emergency sequence, the continuous measure- 
ment of relevant criteria, and the utilization 
of a predictor with a logical relationship to 
the criterion. 

The importance of studying one type of 
accident is shown by noting that such ac- 
cidents as rear-end collisions, loss of control 
on turns, accidents during rain or snow, and 
accidents related to mechanical malfunction 
are usually grouped together. They may have 
completely unrelated human characteristics 
associated with incidence and severity of 
outcome. Also, the classification of real-world 
accidents often does not take into account 
the problem of human culpability. The tacit 
assumption is that an accident is the driver’s 
fault. The term “accident” is often a legal 
designation rather than a description of a 
behavioral act. It is quite possible that 
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having an accident by the legal definition 
is the most effective driving behavior (eg. 
hitting a parked car to avoid a child). 

Significant findings have often been the 
result of improved research tools. Looking at 
a specific structure rather than just a global 
process has made it possible to confirm or 
refute many theoretical issues. By using an 
automobile simulator, control and continuous 
measurement of pertinent variables in an 
emergency situation were possible. Also, the 
simulated situation presented every driver 
with the same emergency, giving constant 
exposure across Ss. This is not to imply that 
the simulator used in the present investigation 
is not without flaws, especially as evidenced 
by the discomfort experienced by the majority 
of Ss. 

The predictor in this study had a logical 
relationship to the criterion behavior. Often 
simple brake reaction to a red light is cor- 
related with accident records (e. g., Cation, 
Mount, & Brenner, 1951) where there is 
little correspondence between the braking 
response elicited by the red light and the 
complex perceptual process in the real world. 


Related Real-World Research 


In a review of the literature, one real-world 
study was found which appeared to be re- 
lated to the behavioral sequence analyzed in 
this experiment. An attitude scale, picture- 
frustration test, Raven Progressive Matrices, 
and a visual perception test were administered 
to accident repeaters and accident free drivers 
(Spicer, 1964). Only the visual perception 
test, motion picture films representing a wide 
variety of traffic conditions, was consistently 
related to driving and traffic mishaps. The 
Ss checked statements concerning each of 11 
film sequences to indicate what he felt was 
important to the driver of the car. This test 
might have some relationship to the concept 
of perceptual style since S was required to 
extract the salient information from the film 
strip in a short period of time (35 sec.). 

It was interesting to note that adolescents 
did not perform as well as older drivers on 
the visual perception test. This could have 
been a function of either inexperience and 
therefore lack of awareness of vital cues in 
driving or field dependence and consequent 
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lower ability to pull a figure from an em- 
bedding context. This point will be discussed 
later in more detail. 


Alternate Theoretical Explanations oj Results 


A parsimonious explanation of the results 
would include a description of perceptual 
abilities, although perceptual style is also 
related to cognitive style (Witkin et al, 
1962). Field-independent individuals tend to 
think more analytically and critically. Pos- 
sibly, field-independent individuals reacted 
faster because they already had a preformed 
set to respond, thinking out in advance how 
they would react to an emergency situation. 
In contrast, the field-dependent person, being 
less analytical, may never have given any 
consideration to how he would react. 

Witkin et al. (1962) have reported that 
field-dependent individuals become confused, 
disorganized, and inadequate under stress. 
Since confusion could also be a factor in 
emergency situations, there apparently are a 
number of possible alternative explanations 
using other correlates of perceptual style 
which could account for the less effective be- 
havior of the field-dependent Ss. 


Extrapolation oj Results to Real-World Ac- 
cident Situations 


Three complex real-world driving behaviors 
can be explained on the basis of the results of 
this investigation and other related perceptual 
style research. First, it has been well estab- 
lished that alcohol is a factor in many at- 
cidents (Fox & Fox, 1963). Witkin et al. 
(1962) report that perceptual style, as meas- 
ured specifically by S’s ability to visually 
extract a figure from an embedding context, 
is affected by alcohol. It is conceivable there- 
fore that while under the influence of alcohol 
a driver would find it harder to extract an 
emerging pedestrian, vehicle, or other obstacle 
from the complex visual field. This could be 
one factor in explaining why drivers under 
the influence of alcohol are involved in a 
high percentage of accidents. 

Second, studies are consistent (Goldstein, 
1962) in showing that young drivers (below 
25) and older drivers (beyond 65) have a 
disproportionately high rate of accidents. It 
has also been shown that perceptual style is 
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a function of age. In a number of cross- 
sectional and longitudinal studies (Witkin 
et al., 1962) it has been determined that 
individuals become more field independent 
as they grow older, with an asymptote around 
21. Since young drivers (16-21) are relatively 
field dependent, this might be a factor in 
their higher accident involvement. While the 
younger drivers as a group have a faster 
simple reaction time, the real-world situation 
involves a complex perceptual element and 
the intruding object must first be recognized 
before an individual can respond to it. 
Similarly, recent evidence (Schwartz & Karp, 
1967) indicates that older individuals are 
field dependent. 

Third, previous research (Witkin et al., 1962) 
has shown that women as a group are more 
field dependent than men. On that basis one 
would predict that women drivers who are 
presented with an emergency driving situa- 
tion would be less effective in responding than 
men. One real-world study has a possible 
bearing on this point. Uhr (1959) found that 
women were more likely to cut across a motor 
Scooter’s path from a stop street than were 
men. It is possible that the women were un- 
able to perceptually extract the small 
scooter from the background as quickly as 
the men. 

It appears feasible to extrapolate the 
results of the present study to several other 
real-world situations. Any occasion where an 


individual must quickly detect, identify, and 


properly react to a stimulus could be related 
to perceptual style. Many examples could 
be given such as sudden emergence of a child 
from between parked cars, the swerving of an 
automobile into the driver's projected path, 
or the emergence of an automobile from an 
alley or hidden driveway. The generality of 
these results, however, must await future 
research. 

Past research with the Embedded Figures 
Test (EFT) and the RFT may also be 
brought to bear on practical traffic problems. 
It is known that various factors decrease 
one’s ability to pull an embedded item from its 
context, such as alcohol, fatigue, and brain 


damage. Besides these factors, which may 
cause severe decrement in performance for 


anyone, there are subtle genetic and environ- 


mental factors which came a wide range of 
individual differences even under optimal 
testing (oF driving) conditions. For this tense 


Since it Is not now feasible to restrict 


field-dependent people from driving, training 
could be established for those who find 


. Perception, 
per se, is resistant to training. It is, however, 
possible to “train around” certain deficits. 
That is, it is possible to identify perceptual 
people to compensate for 


as they detect the emerging image. In this 
manner, the lag time between detection and 
identification may be utilized in decelerating 
the vehicle. This amounts to telling the 
driver to “be careful,” which many slogans 
do, but in a very specific way. 

In conclusion, this discussion illustrates one 


Further research, of course, is required to 
confirm the hypothesized functional relation- 
ships between perceptual style and human 
performance in specified emergency situa- 
tions. Also, a replication of this study in the 
real world should be conducted, especially in 
light of the restriction in sample imposed 
by the “simulator sickness.” There is now, 
however, some empirical evidence which 
gives a basis for a number of specific in- 
vestigations. 
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OF BASIC MAINTENANCE TASKS 


RICHARD J. SHAVELSON asn JOSEPH L. SEMINARA? 
Lockheed Missiles and Space Company, Sunnyvale, California 


9 Ss were trained extensively on 3 maintenance tasks: bolt torquing, con- 
nector mating, and nut threading. They were randomly distributed into 1 of 
3 clothing conditions (shirt-sleeve, vented suit, and pressurized suit) and 
trained and tested on all 3 tasks in 3 gravity conditions (1 gravity, 1 gravity 
harnessed, and % gravity). This study demonstrated that lunar gravity 
imposed a 25% performance decrement over performance in 1 gravity (p < 01), 
The vented suit imposed a 60% performance decrement (p < .01), and the 
pressurized suit imposed a 150% performance decrement (p < .01) when 
compared to performance in the shirt-sleeve mode. On the basis of these 
findings and subjective reports, preliminary human-factors design criteria 
were suggested for lunar-gravity performance aids. The need for subsequent 
research in the areas of mission-specific maintenance tasks and candidate 
job aids to improve performance in the lunar environment was pointed out. 


applied psychology 


June 1968 


EFFECT OF LUNAR GRAVITY ON MAN’S PERFORMANCE 


Past research, employing a variety of simu- 
lation methods, demonstrated that man’s be- 
havior in lunar gravity differed considerably 
from his behavior in the earth’s gravity. 
Hewes? and Spady (1964a, 1964b) found 
that man could jump vertically as high as 14 
ft. in lunar gravity and perform other ordi- 
nary tasks in an extraordinary manner. 
Roberts (1963) investigated man’s walking 
behavior in lunar gravity which he described 
as a fast walk in slow motion. Prescott and 
Wortzs (1966) and Hazard (1965) showed 
that, during walking, man’s metabolic rate 
decreased in 6 gravity when compared 
to his 1-gravity rate. Although there has 
been a comprehensive examination of man’s 
physical and physiological behaviors in lunar 


The authors wish to acknowledge the assistance 
of G. F. O'Keefe, C. J. Baker, and A. L. Hall in 
Providing life support, communications, and technical 
advice for pressure suit research, and R. E. Mahan, 
simulation engineer, 8 

D. E. Hewes, A. A. Spady, Jr, & R. L. Harris. 
Comparative Measurements of Man’s Walking and 
Running Gaits in Earth and Simulated Lunar 
Gravity. NASA Tech. Note D-3363. Unavailable. 

3E. C. Wortz & E. J. Prescott. The Effects of 
Subgravity Traction Simulation of the Energy Costs 
of Walking. Personal communication, 1965. Pp. 2-26. 


gravity, a few studies have concerned them- 
selves with man’s performance of maintenance 
tasks. Holmes (1965) studied the effectiveness 
of a variety of tools on maintenance tasks in 
various gravity conditions including lunar 
gravity. Using two subjects (Ss) for his 
lunar-gravity simulation, he found that per- 
formance times increased by 8% over per- 
formance in the earth’s gravity. However, due 
to the paucity of Ss, the large number of 
tasks under examination, and the lack of 
statistical treatment, Holmes’ findings were 
considered tentative, at best. 

The purpose of the present research was 
to develop data regarding the effects of lunar 
gravity and candidate lunar clothing con- 
ditions (shirt-sleeves, vented suit, pressurized 
suit) on man’s performance of three basic 
maintenance tasks (bolt torquing, connector 
mating, and nut threading) to serve as guide- 
lines for task analyses, timelines, and per- 
formance-aid designs. It was hypothesized 
that performance times in % gravity would 
increase over those in 1 gravity, that per- 
formance times would increase with increas- 
ingly encumbering clothing requirements, and 
that the connector-mating task would be the 
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least affected by the gravity or clothing- 


condition variables, 


Muro 

Subjects 

Nine male Ss participated in this study. The Ss 
were selected from a pool of “astronaut” test Ss 
maintained by Lockheed Missiles and Space Com- 
pany's (LMSC) Biotechnology Organization. The 
Ss fell between the ages of 25 and 36, all were engi- 
neers or physical scientists, all were under 6 ft. in 
height, and all had performed as a pressure-suiled 
test S$ in previous simulation research. 


Apparatus 


The effect of lunar gravity was achieved by 
using LMSC’s -gravity simulator, LUNARG. This 
unit, which supports five-sixths of the S's weight, 
consisted of nine negator spring motors attached to 
a cable reeling system, The S was attached to the 
aforementioned cable via a harness. Fine balancing 
of plus or minus 1 Ib. accuracy was achieved by 
placing a skin diver's weight belt around S's waist. 

The space suit employed in this study was an 
Apollo A-4H training suit, on loan from NASA, 
Houston. The suit consisted of a Model C3 helmet, 
a torso-limb assembly, gloves, and astro-cap. 

Since the Apollo A-4H pressure suit had a helmet 
without a visor, the vented-suit condition was de- 
fined as helmet and gloves on with an air ventiliza- 
tion flow of 10 cu. ft. per minute, The pressurized- 
suit condition was defined as suit pressurization 
at 3.7 +2 psi differential with air, The clothing 
condition referred to as shirt-sleeve was defined 
as street clothing: shoes, socks, slacks, and shirt. 

The performance-fixture apparatus was a unistrut 
frame structure 9% ft. in height and 19 in. in width 
on which a uniform-sized panel, 54 in. in height 
X 19 in. in width X % in. in depth, was placed. 
The panel contained three tasks: (1) bolt torquing, 
(2) connector mating, and (3) nut threading. On 
S's left-hand side of the panel were nine threaded 
receptacles into which bolts 1 in. long with a 746 
of an inch hexagonal head were screwed and then 
torqued down to 70 in. Ib. using a torque wrench. 
Nine identical Deutsch push-pull type connectors 
C4 in. in diameter) were located in the middle of 
the panel. Twelve spaced bolts of three 


different sizes (N. %e, % in.) protruded from the 
right-hand side of the panel. Two of the nuts were 
hexagonal and fitted on the two larger bolts. The 


third was a small wing nut. Attached to the per- 
formance fixture at a height of 43 in., and extend- 
ing out to the right of the panel, was a contents 
tray (24 X 614 X 60ʃ in.). During the experiment, this 
tray contained the torquing wrench and a predeter- 
mined number of each of the task elements. 


Design 


The experimental design employed in this research 
was a partial replication of an 81-celled 3X 3X3 
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factorial. Nine Ss were randomly assigned to one 
of three clothing conditions (shirt-sleeve, vented 
sult, or pressurized suit) with three Ss in cach 
condition. All nine Ss, regardless of clothing com- 
dition, performed all three subtasks (torquing, con- 
nector mating, and threading) in all three gravity 
conditions (1 gravity, 1 gravity harnessed, and 
% gravity). In order to minimize the effects of 
practice, Ss were tested in the principal condition, 
gravity, in a counterbalanced order 


All Ss received task orientation and preliminary 
training in the shirt-sleeve, 1-gravity environment. 
They were then trained and tested in their 


respective clothing conditions. 


Procedures 


The experiment was divided into two parts: (1) 
orientation and preliminary training, and (2) 
training and testing. The former part 
describes the instructions given to the Ss and the 
base-line training in the shirt-sleeve 1-gravity 
environment. The latter part describes training in 
specific clothing and gravity conditions and testing. 
Orientation and preliminary training. The Ss were 
instructed to perform each task separately, always 
working from panel top to bottom. They were told 
to work as quickly and as accurately as possible 
since both time and errors were being recorded. 
One exception to this procedure was necessary. 
During the course of the experiment, it was dis- 
covered that Ss participating in the pressurized-suit 
condition were unable to successfully complete all 
of the task elements on the performance panel. This 
was not due to the inability of Ss to reach all of 
the task elements, since for all Ss regardless of 
clothing condition the top of the panel was placed 
1 in. above their shirt-sleeve height, but rather 
to the amount of fatigue and the difficulty of manip- 
ulation in the extreme high and low positions. For 
example, although Ss were able to place the nine 
bolts into the panel, they were unable to torque the 
bolts in the highest and lowest positions with the 
torque wrench to the preset value of 70 in. Ib. As 
a consequence, the work area of the panel was re- 
duced such that the pressure-suited Ss were required 
to complete the middle five torquing and connector- 
mating task elements, and the middle eight threading 
task elements. Average time to complete one task 
element was computed for all Ss on all tasks. 

At the completion of the instructions, Ss were 
given an opportunity to manipulate the three tasks 
until they were satisfied that they had fully under- 
stood the instructions. The Ss were then trained in 
the shirt-sleeve 1-gravity mode until an asymptote 
in time and errors was reached. This required about 
one-half to 1 hr.’s time. 

Specific training and testing. Specific training and 
testing proceeded first with the shirt-sleeve group» 
then with the vented-suit group, and lastly with 
the pressurized-suit group. In all three clothing con- 
ditions, Ss were trained to asymptote in time and 
errors in a particular gravity condition, and then 
tested immediately afterward. At the completion of 
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fe aymplote and then tested. This procedure 
carried out for all three gravity conditions 

fo perform cach task, time between 
errors were recorded during each test. 
tion of the testing, Ss were asked to fill out question- 
naires relating to performance in clothing and gravity 


Resutts 


The results reported in this section are 
divided into three parts: base-line training, 
Specific training and test, and questionnaire 
responses. 


Base - Line Training 


The mean number of shirt-sleeve, I-gravity 
training trials to performance asymptote was 
8.89 with a standard deviation of 2,31 trials. 
The mean time to perform one torquing task 
element at asymptote was 11.37 sec. with a 
Standard deviation of .81 sec. The mean time 
to perform one connector-mating task ele- 
Ment at asymptote was 3.12 sec. with a 
Standard deviation of .25 sec. The mean time 
to perform one threading task element at 
asymptote was 8.53 sec. with a standard 
deviation of .60 sec. It is important to remem- 
ber that the performance times reported refer 
to mean time to perform one task element, for 
example, the mean time to torque one bolt, 


instead of the total time to perform all nine 


bolt torques. A total of one error was com- 
mitted by the nine Ss at base-line training 
asymptote. 


Specific Training and Test 


All nine Ss received base-line training in 
the shirt-sleeve, 1-gravity mode. At the com- 
Pletion of this training, Ss received specific 
training on all tasks in a particular clothing 
Condition and in a given gravity condition. 
Since the gravity conditions were counter- 
balanced, individual Ss required different 
numbers of training trials for different cloth- 
ing conditions and different gravity sequences. 
The latter depended on the amount of transfer 
of training from base-line training. For ex- 
ample, training and test in 1 gravity had com- 
plete transfer to the 1-gravity-harnessed 


condition while transfer frees 1 gravity to 0 
gravity showed moch hess transfer, depending 
On preceding specific training 
sequences. 

The results of this experiment 
marized in Table 1 and Figure 
formance times. In general, there 
nificance in performance times bet 
three clothing conditions (p < 01). 
analysis of this finding revealed that 
ance In the vented suit required more 
than in the shirt-sleeve mode (p < 00). 
formance in the pressurized suit 
nificantly more time than the 
(P< 01) and, therefore, 
cantly more time than the shirt-sleeve 
dition, In terms of percentage of time 
the vented suit showed a 63.61% 
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in the 1-gravity- 
harnessed condition (# < 01), and signifi- 
cantly longer than in the 1-gravity condition. 
With respect to percentage of increase in time, 
the 1-gravity-harnessed condition showed a 


The difference in performance times for 
each of the three tasks was significant beyond 
the .01 level of confidence. Using a 1-degree- 
of-freedom test, it was demonstrated that the 
threading task took significantly longer than 
the connector-mating task (p < .01). Further, 
it was demonstrated that the torquing task 
took significantly more time to perform than 
the threading task (p < .01). In terms of 

tage of time increase, the threading 
1 a 192.24% increase over the 
connector-mating task. The torquing task 
showed a 267.18% increase over the con- 
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Fic. 1. Performance times by task for all gravity and clothing conditions. 


nector-mating and a 25.64% increase over 
the threading task. 

The interaction between clothing and 
gravity was not significant. This was inter- 
preted to mean that for every level of the 
gravity condition, performance in the clothing 
condition remained relatively constant. In 
short, the 6-gravity condition required 
greater time to perform each of the tasks 
regardless of clothing condition, and regard- 
less of clothing condition, 1 gravity and 1 
gravity harnessed required about the same 
time to perform. 

The interaction between clothing condi- 
tion and task was significant ( < .01). This 
was interpreted to mean that the difficulty 


of the tasks did not remain constant for every 
level of clothing condition. 

Specifically, the threading task when in the 
vented suit showed less of an increase in 
performance time over the shirt-sleeve condi- 
tion than did the torquing task in the same 
clothing mode (56.51% versus 76.68%). 
However, a comparison of performance times 
in the pressurized-suit mode revealed that 
the tasks were approximately equivalent for 
this condition (130.55% versus 138.48%). 
Further, the connector-mating task showed 
much less of a performance-time increase in 
the pressure-suit mode (87.53%) than did 
either the threading (130.55%) or the torqu- 
ing (138.48%) tasks. 
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TABLE 1 
STATISTICAL ANALYSIS OF PERFORMANCE Times 


Source SS d| MS F 
Clothing (A) 1,349.31} 2| 674.65) 26,92°* 
Subjects within 

groups (B) 150.34 6 25.06 
Gravity (C) 128.88 2 64.42 26.40" 
Task (D) 2,630.99 | 2 | 1,315.50 839.14 
AXC 16.64| 4 4.16 1.70 
Ax D 349.69| 4 87.42 | 35.83°* 
cxD 30.211 4 7.55 3.09" 
BNC 41.28 12 3.44 1.41 
BXD 100.39 | 12 8.36 | 343% 
AXCXD 19.72| 8 2.46| 1.01 
BXCXD 58.57 | 24 2.44 

*p <.05 

* 2.01 


The interaction between gravity and task 
was significant (p< .05). Again, the diffi- 
culty of the tasks did not remain constant 
for all gravity conditions. The finding is sup- 
ported by the fact that the threading task 
in 1% gravity took 35% more time to perform 
than in 1 gravity. 

The interaction between Ss within groups 
and gravity condition was not significant. 
All Ss experienced a decreased efficiency in % 
gravity while performing approximately the 
same in either 1 gravity or 1 gravity har- 
nessed. However, the interaction between Ss 
and task was significant (p < .01). Different 
tasks were of varying difficulty for each S 
within a given clothing condition. Finally, a 
comparison of the intertask intervals re- 
vealed no differences between task, clothing, 
or gravity conditions. The mean interval 
was 15 sec. 

The term “error,” as employed in this re- 
search, referred to the frequency of occurrence 
of dropping any task element, or holding on 
to any part of the performance panel or 
structure framework while performing any of 
the tasks. There was no attempt made to 
assign relative weights according to the nature 
of the error. The Friedman two-way analysis 
of variance by ranks revealed that perform- 
ance as measured by errors did not differ 
significantly for any of the three main vari- 
ables, and there were no significant inter- 
actions. These findings were due, in part, to 
the loose definition of “error.” 
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Questionnaire Responses 


Subjective impressions of test Ss regarding 
the task under various gravity conditions 
were examined for conformity with the ob- 
jective data reported above. All nine Ss 
agreed that the harness, per se, made no dis- 
cernible difference. Therefore, the primary 
differences of concern were between 1 gravity 
and % gravity for the various suited con- 
ditions. 

In this comparison, whenever a number is 
reported in quotes, it refers to the scale 
number on a scale from “1” (easy) to “5” 
(very difficult), and is a mean representation 
of the three responses. 

The Ss were asked to rate the entire main- 
tenance task from easy to very difficult. The 
shirt-sleeve Ss rated the task as easy in both 
1 and 16 gravities, “1” and “1,67,” respec- 
tively. The vented-suit Ss found the task 
moderately difficult in both gravities, “3” 
and “3.67.” The pressure-suit Ss rated task 
difficulty as “3” for both gravity conditions. 

When asked to identify the most difficult 
single task, the connector-mating task was 
never indicated. All three shirt-sleeve Ss con- 
sidered the threading task most difficult in 1 
gravity, and considered the torquing task 
most difficult in % gravity. Two vented-suit 
Ss considered the threading task to be the 
most difficult in 1 gravity, and all three 
identified the torquing task as the most 
difficult when in % gravity. Torquing and 
threading were identified as equally difficult 
by the pressure-suited Ss in both gravities. 

The Ss did, in no instance, identify the 
middle portion of the performance panel as 
difficult to work on. Two shirt-sleeve Ss 
identified the lower portion as the most diffi- 
cult to work on in both gravities. The higher 
portion of the panel was identified as the 
most difficult by all three vented-suit Ss. 
However, two pressure-suit Ss described the 
lower portion of the panel as most difficult to 
work on. In short, it appears that extreme 
positions, high or low, caused the greatest 
performance difficulty relative to mainte- 
nance-task operations. 

The shirt-sleeve Ss did not consider the 
maintenance task fatiguing in either 1 or \% 
gravity. Their ratings were “1” and “1.67,” 
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respectively. The vented-suit Ss found the 
task moderately fatiguing, “3,” for both 
gravities. The pressure-suited Ss rated the 
task as “3” in 1 gravity and “4” in % 
gravity. The indication here is a trend in 
clothing condition going from the shirt-sleeve 
to pressure-suit mode. It is reasonable to 
suggest that, if the pressure-suit Ss were 
required to finish all task elements, they 
would have reported extreme fatigue. 

The Ss in all three clothing conditions 
unanimously agreed that they felt they had 
committed the most errors on the threading 
task, irrespective of gravity condition. They 
also unanimously agreed that they had to 
change their method of performance of the 
tasks when in % gravity as compared to per- 
formance in 1 gravity. 

The vented-suit Ss reported that perform- 
ance dexterity was moderately reduced, “3,” 
when compared to their shirt-sleeve base-line 
training. The pressure-suit Ss rated suit en- 
cumbrance as “5,” or extremely restrictive. 
Finally, no pressure-suit Ss reported anxiety 
in working in the pressurized mode. 


Discussion 


The results of this study supported the 
hypothesis that a maintenance-task decre- 
ment would be manifested in lunar gravity 
when compared with performance in the 
earth’s gravity. The finding of no statistical 
difference between the 1-gravity and 1-grav- 
ity-harnessed conditions indicated that the 
performance decrement in lunar gravity was 
not due to the requisite harness. The Ss’ 
responses to the questionnaire also supported 
these findings: “In lunar gravity, I stood on 
tiptoes. More difficult to aim screws or nuts 
due to less stability. Torquing produced 
counter torques pushing me away from the 
panel.“ 

Perhaps the most significant finding of 
this research was that Ss performing in the 
pressurized-suit condition were unable to 
complete all of the tasks. Further, the results 
of an analysis of the performance times sup- 
ported the hypothesis that the shirt-sleeve 
clothing condition would impose the least 
restriction on performance while the pres- 
surized suit would impose the greatest per- 
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formance restriction. These findings con- 
curred with Holmes’ (1965) description of 
the pressurized-suit mode as being the single 
most important variable hindering mainte- 
nance tool performance. 

The results of this research also supported 
the hypothesis that the connector-mating task 
would take less time to perform, while the 
torquing task would require the greatest 
amount of time due to counter-torques. The 
hypothesis that the threading task would 
cause the greatest number of errors was also 
supported. The interactions involving task 
indicated that maintenance tasks may be 
affected differentially by the person perform- 
ing them, the clothing condition, and the 
gravity condition in which they are performed. 


Task Analysis and Timeline Studies 


During successive stages of design and 
development of a complex system, information 
regarding the nature of the human interface 
with the system in terms of crew functions 
and tasks is necessary in order to insure that 
the system does not impose performance re- 
quirements beyond man’s capabilities. One 
purpose of this research was to derive extra- 
polation figures for maintenance-type tasks 
so that performance-time estimates could be 
made more accurately for lunar maintenance 
timeline studies. 

The performance data, summarized in 
Figure 1, provide an empirical basis for as- 
signing performance times to related lunar 
mission activities. For example, in estimating 
performance times for lunar intravehicular 
tasks in shirt-sleeves, 25.67% increment in 
performance time should be made over com- 
parable earth-based performance. Such gen- 
eralization from the data obtained in this 
study would provide a more valid basis for 
performance-time allocations than the “arm- 
chair analysis” technique so often resorted to. 


Performance Aids 


The findings of this research, that man’s 
behavior changes under the influence of lunar 
gravity, concur with the lunar-gravity re- 
search literature. The performance decrement 
observed in lunar gravity and in the various 
clothing conditions points to the necessity of 
considering human engineering design counter- 
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measures for coping with the peculiarities of 
the partial gravity lunar environment. While 
considerable attention is currently being given 
to performance aids for assisting the astronaut 
in the zero-gravity environment (Pierson & 
Geller, 1965), relatively little attention has 
been devoted to a comparable study of per- 
formance aids required to facilitate lunar 
maintenance and operational tasks (Holmes, 
1965). Based on the results of the present 
research, it may be just as necessary for the 
astronaut to anchor himself to his task in 
the lunar environment as it is for earth orbital 
operations. 

The following represents guidelines for 
unique design conceptualization of perform- 
ance aids. Handholds should be provided for 
the astronaut primarily as a method for 
obtaining a satisfactory position relative to 
the task. However, if possible, the astronaut 
should not be required to use his hands in 
steadying his position relative to his equip- 
ment task since this would limit his ability 
to use both hands in accomplishing tasks. It 
appears that a one-hand grip would not be 
sufficient to steady his position for many 
tasks. Properly spaced and distributed toe 
holds and tether anchor points appear to 
offer promise in helping to steady the astro- 
naut’s position for tasks involving force ap- 
plication such as the torquing task. Holmes 
(1965) investigated tether concepts in lunar 
gravity for maintenance tool work and con- 
cluded that a single tether located from the 
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navel area of the suit to the task was the 
best candidate. Further research is required. 
It has also been pointed out that the pres- 
surized suit prohibited performance in ex- 
treme positions. In addition to the tethers and 
handholds just cited, extravehicular work 
may require lunar ladders or toe holds on 
the side of structures where work is required 
above the astronaut’s shoulders. Also, work 
areas extremely close to the lunar surface 
should be avoided where possible. 
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MEMORY DYNAMICS AND WORK MOTIVATION`+ 
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Washington University 


Of 143 workers asked to report their pleasant and unpleasant experiences, only 
24 included work as 1 of their outstanding pleasant experiences. These 24 com- 
posed the work-motivated group. A group equated for age and job level was 
selected as a control group. The 2 groups were compared on measures of: 
(a) memory optimism, determined by predominance of pleasantness over un- 
pleasantness; (b) ratings of their adequacy as workers; (c) autonomy; (d) in- 
terpersonal competence. Significant differences were found in favor of the work- 


motivated group: 


they were more often memory optimists and superior in 


work adequacy, autonomy, and interpersonal competence. The relationship of 
these findings to the motivation-to-work controversy is considered. 


About work motivation and satisfaction al- 
most everything has been said, but not every- 
thing is known. Since Herzberg, Mausner, and 
Snyderman (1959) published their book, Tke 
Motivation to Work, in which they advocated 
a two-factor theory of job satisfaction, Herz- 
berg and his supporters have published many 
articles to prove this theory. However, other 
contributors studying the same problem are 
not at all convinced that Herzberg is right. 
Evidence negative to Herzberg’s theory has 
been reported in studies by Dunnette (1965), 
Ewen (1964), Friedlander (1963), and others. 
A review of the literature indicates that those 
studies which best support Herzberg’s theory 
are those which use the same or similar meth- 
ods. Quinn and Kahn (1967), in their review 
of organizational psychology, have this to 
say about the Herzberg controversy: 


Whereas Herzberg has rested a stimulating theory of 
job satisfaction on a rather shaky empirical founda- 
tion, the Cornell group (1963-65) have launched an 
intensive series of studies of the measurement of job 
satisfaction with only a minimum of theory to guide 
them [p. 456]. 


According to the Cornell researchers (Ewen, 
Smith, Hulin, & Locke, 1966), their conclu- 
sion is that neither the Herzberg theory nor 
the traditional theory are supported by the 
data they investigated. Instead, results indi- 
cate that intrinsic factors are more closely 


1 Based on a report presented at the Divison 14 
session on Organizational Behavior at the American 
Psychological Association, Washington, D. C., Sep- 
tember 9, 1967. 

Now at Concordia Senior College, Fort Wayne, 
Indiana. 


related to overall satisfaction and overall dis- 
satisfaction than the extrinsic factor, pay, 
and suggest that functioning of the extrinsic 
variable may depend on the level of satisfac- 
tion with the intrinsic variables. In the litera- 
ture on work motivation by Herzberg and 
protagonists, as well as by his antagonists, the 
problem of personality differences between a 
motivated group and a nonmotivated group 
has not been studied. 

Vroom (1962) hypothesized and confirmed 
that if a person believes himself to possess an 
ability and believes that successful perform- 
ance of his task requires that ability, he will 
prefer performing the task effectively to per- 
forming it ineffectively. If he believes effective 
performance is irrelevant to his ability, he will 
be indifferent about performing. If effective 
performance is negatively relevant, he will 
prefer to perform ineffectively. 

Porter (1963a, 1963b), in a series of studies 
of managers, showed that their satisfactions 
were substantially related to such variables as 
level of job, line-staff type of position, and 
size of company. His studies are based on 
a modified form of Maslow’s hierarchy of 
needs. In that sense it can be considered a 
problem of motivation and satisfaction. How- 
ever, he, too, has neglected to study person- 
ality characteristics of work-motivated people. 

In the first study on the problem of memory 
dynamics of workers, Meltzer and Ludwig 
(1967) found that, though younger workers 
are significantly more often memory optimists 
than older workers, each age group signifi- 
cantly expressed pleasant memories more often 
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than unpleasant. These findings were inter- 
preted as suggesting that personality factors, 
rather than age factors as such, influence 
memory dynamics involved. In a study still 
unpublished, a relationship was found be- 
tween memory dynamics of workers and cer- 
tain personality variables. While working with 
the memories reported, it was apparent that 
only a few included work as one of their out- 
standing pleasant experiences of a lifetime. Of 
the 143 workers, only 24, or 16.78%, included 
work as one of their outstanding pleasant ex- 
periences. It dawned on the investigators that 
here, if ever, was a work-motivated group. 
Not only did work have intrinsic satisfaction 
to them, but work evidently was ego-involv- 
ing enough to be included in pleasant experi- 
ences of a lifetime. It would be interesting to 
compare workers of that kind with those to 
whom work did not have that kind of ego- 
involving meaning. Accordingly, it was ar- 
ranged to compare an equated group with 
them on four measures—memory optimism, 
work adequacy, autonomy, and interpersonal 
competence. 

The purpose of the present study was to 
fill, in some small measure, the gap in knowl- 
edge about the relationship of work satis- 
faction to personality characteristics by com- 
paring a work-motivated group with a non- 
work-motivated group in certain personality 
variables, including memory dynamics. The 
problems studied, expressed in the form of 
hypotheses, were: (1) More of the work- 
motivated group will be memory optimists; 
(2) the work-motivated group will be sig- 
nificantly higher in work adequacy than the 
non-work-motivated group; (3) the work- 
motivated group will be significantly superior 
in the two personality variables measured 
autonomy and interpersonal competence. 


METHOD 
Subjects and Setting 


The Ss used in this study were 143 workers in a 
paper converting industry located in a stable college 
community in upstate New York. This paper mill is 
the only plant of any size in the community, and 
obtaining a job there is considered to be a sign of 
establishment to many people living in the com- 
munity. For the most part, Ss were semiskilled 
workers with large families and strong feelings about 
their jobs in the industry. The Ss ranged in age from 


19 to 78 and represented about one-third of all the 
workers in the plant. The 143 were selected from a 
larger group because they were the only ones inter- 
viewed for data concerning pleasant and unpleasant 
experiences, All of the workers who mentioned their 
work experiences as a part of their pleasant memories 
were classified in the work-motivated group. A 
group matched to these 24 workers was selected from 
the remaining 119 workers by equating them for 
age and job level. 


Source Material 


In the interview as structured, the directed ques- 
tioning about work, family, personal story, and 
present conditions and problems was presented in 
open-end fashion. The more projective phases 
(Meltzer, 1950) of the interview, which in this study 
include the recall of outstanding pleasant and un- 
Pleasant experiences of a lifetime, were given after 
the factual phase of the interview was completed, 


Procedure 


The interview was structured in the following 
sequence: 


1, Direct questioning about work, family, personal 
story, and present conditions or problems. 

2. Recall of the most outstanding pleasant experi- 
ence with the question, “And now tell me what comes 
to your mind when you think of the most outstand- 
ing pleasant experience of your life.” 

3. Recall of other outstanding pleasant experiences. 

4. Recall of the most outstanding unpleasant ex- 
perience. 

5. Recall of other outstanding unpleasant experi- 
ences, 


Memory optimism was determined by predominance 
of pleasantness over unpleasantness in the experi- 
ences given by the S; memory pessimism was de- 
termined by the predominance of unpleasantness 
over pleasantness. In the original article, however, 
wishful and eternal optimists were differentiated from 
realistic optimists (Meltzer & Ludwig, 1967). 

Work competence was based on the rating by the 
interviewers at the end of the interview. The inter- 
viewers had available the ratings of the respective 
foremen of the Ss studied to consider as an aid in 
making their judgments. Scores for various ratings 
were as follows: 1 for D; 2, D+; 3, C; 4, C+; 
5, B; 6, B+; 7, A; 8, At. 

Autonomy was determined on the basis of a con- 
tent analysis of the personal and work histories given 
by the S, including projective interview material. 
The S was given a “+” in autonomy if the histories 
contained statements indicating adequate security 
and adjustment; a “—” indicated insecurity and mal- 
adjustment; and a “0” was assigned if no clear as- 
sessment could be made. A second judge made inde- 
pendent ratings of the same factors, with a correla- 
tion of 6 between the two. (A 3X3 contingency 
table was used with the maximum correlation being 


82.) 
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Interpersonal competence was also determined by a 
content analysis of the same source material, namely 
the work and family histories, plus projective re- 
sponses to memories of pleasantness and unpleasant- 
ness. An S was given a “+” in interpersonal com- 
petence if the histories contained statements indi- 
cating that S got along well with others. If there 
were statements indicating a history of poor rela- 
tions with others a “—" was given, A “O” was given 
if no clear judgment could be made. The rating of 
the second judge correlated .64 with the first, (A 
3 X 3 contingency table was used with the maximum 
correlation being 82.) 


RESULTS 


Difierences in Memory Optimism and Work 
Adequacy (Hypotheses 1 and 2) 


The two groups were compared on memory 
optimism and work adequacy, or performance. 
The means, standard deviations, and ¢ values 
for the two groups on memory optimism and 
work adequacy are reported in Table 1. 

These results support the first two hypoth- 
eses; namely, that the work-motivated group 
is composed of people who are more likely to 
be memory optimists, and that the work- 
motivated group rates significantly higher on 
work adequacy. 


Differences in Autonomy and Interpersonal 
Competence (Hypothesis 3) 


The Ss were then compared on the measures 
of personality variables—autonomy and inter- 
personal competence. A comparison was made 
in a 2 X 2 fashion with the chi-square values 
as a measure of significant difference. The 
relevant data for the comparison of the two 
groups are presented in Table 2. On the mea- 
sure of autonomy, the work-motivated group 
rated significantly higher than the matched 
group. On the measure of interpersonal com- 


TABLE 1 


DIFFERENCES BETWEEN WorK-Motivatep GROUP AND 
CONTROL GROUP on Memory Oprmism 
AND WORK ADEQUACY 


Factor Group SD |t value 
Memory 
optimism Work-motivated 
3.47* 
Non: work. motivated 
Adequacy Work-motivated 
2.89% 
Non-work-motivated 1 
p< .01. 
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TABLE 2 


Dirrerexces BETWEEN Wonx-Morwarrp Grour ann 
CONTROL GROUP on AUTONOMY AND 
INTERPERSONAL COMPETENCE 


| 


Gr Positive! Negative 
Factor p factor fact of * 
Autonomy Work · motivated u 3 T 
9.03% 
Non-work- 
motivated s 16 
Interpersonal 
competence Work motivated i 6 | 
4.04° 
Non-work- — 
motivated 5 Ero? 
*p <05. 
* l. 


petence, the work- motivated group was rated 
more favorably than the matched group. These 
two findings support the third hypothesis; 
namely, that the work-motivated group is 
significantly superior on the two personality 
variables -autonomy and interpersonal com- 
petence. ` 


DISCUSSION 


Significant characteristics which differenti- 
ate the work-motivated group from the con- 
trol group are revealed in this study. In every 
respect measured, the work-motivated group 
comes out more favorably or superior than 
the equated control group of fellow workers. 
The work-motivated group contains signifi- 
cantly more realistic memory optimists, who 
are rated more adequate as workers on the 
job, are more autonomous or secure and ad- 
justed, and are more competent in their in- 
terpersonal relations. The two groups are 
greatly different in memory dynamics and job 
adequacy as well as personality variables. In 
this plant, at least, people who are work 
motivated as evidenced by their ego involve- 
ment in the form of expressed feelings in their 
life memories are identified by better per- 
formance, better adjustment, and more op- 
timistic outlook on their past lives. The pro- 
cedure described here has been and can be 
used to advantage for management-evaluation 
purposes. For research purposes it would be 
worthwhile to compare the findings based on 
a similar study with the results obtained on 
the same workers with the use of the Critical 
Incident Approach of Herzberg. Such explora- 
tion opens up problems of significance, the 
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SUPERVISORY INFLUENCE AND WORK-GROUP 
PERFORMANCE 


KURT R. STUDENT 


Center for Research on Utilization of Scientific Knowledge, University of Michigan 


Incremental influence, a new concept for the study of organizational leadership, 
was empirically tested by relating 12 work-group performance measures with 
measures of supervisors’ social power. Referent power and expert power were 
conceptualized as incremental influence and considered qualitatively different 
from reward power, coercive power, and legitimate power. Referent power 
and expert power are considered to be idiosyncratic in character and depend 
upon an individual's unique role behavior, while reward power, coercive power, 
and legitimate power are organizationally determined and designed to be 
equal for all supervisors on the same hierarchical level. As predicted, expert 
power was related to a low accident rate, referent power was somewhat 
related to measures of subordinate withdrawal, and both referent power and 
expert power were generally related to group measures of production per- 
formance. These results are regarded as supporting the concept of incremental 


influence. 


A recent book by Katz and Kahn (1966) 
has suggested a new concept for the study of 
organizational leadership. The concept, in- 
cremental influence, is stated as follows: 


When we think of leadership in contrast to routine 
role performance, however, we become particularly 
interested in those kinds of individual behavior 
which go beyond required performance and realize 
more fully the potential of a given position for 
organizational influence. In other words, we con- 
sider the essence of organizational leadership to be 
the infiuential increment over and above the 
mechanical compliance with routine directives of 
the organization [p. 302]. 


The present research is an empirical investiga- 
tion of the concept of incremental influence in 
an industrial organization. More specifically, 
this research will analyze some performance 
correlates of the first-line supervisors’ in- 
cremental influence. 

A five-fold typology of social power sug- 
gested by French and Raven (1959) was 
used as the framework for the author’s 
analysis of supervisory influence. Although 
other bases of power could be cited, French 
and Raven specify these five as the primary 
by which an agent, O, can exert influence 
over a person, P: (1) Reward power, based 
on P's perception that O has the ability to 
mediate rewards for him. In order for O to 
use reward power to influence P, he must 
possess resources that P values, and P must 


believe that by conforming he will actually 
benefit by the resources in question. (2) 
Coercive power, based on P’s perception that 
O has the ability to mediate punishments for 
him. Analogous to the situation for reward 
power, in order for O to use coercive power 
to influence P, P must want to prevent some 
act of O and believe that through conforming 
he will do so. (3) Legitimate power, based on 
internalized values in P that O has a right 
to influence P and that P has an obligation to 
accept this influence. (4) Referent power, 
based on P’s identification with O. French 
and Raven define identification as “a feeling 
of oneness of P with O, or a desire for such 
identity.” If O is a person toward whom P 
is attracted, P will have a desire to be closely 
associated with O. This identification of P 
with O can be established or maintained if 
P behaves, believes, or perceives as O does. 
(5) Expert power, based on P’s perception 
that O has some special knowledge or expert- 
ness in a given area. French and Raven hold 
that the strength of expert power varies with 
the extent of knowledge which P attributes 
to O and that the range of expert power is 
restricted to those cognitive areas in which 
O is believed to have superior knowledge. 
Experience, training, reputation, demonstrated 
ability, etc. are among the many reasons 
why P attributes expertness to O. 
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This five-fold typology of the bases of 
supervisory power makes possible an im- 
portant conceptual distinction between the 
bases of power and leads to an operational- 
ization of the concept of incremental in- 
fluence. The areas in which reward power, 
coercive power, and legitimate power may 
be exercised are largely specified by the 
organization; on the other hand, areas in 
which the supervisor can exercise referent 
power and expert power are to a substantial 
degree uniquely determined by his own be- 
havior and his interactions with his sub- 
ordinates (expert power is personally de- 
termined, but the amount of information 
available may vary as a part of organiza- 
tional structure). The extent and range of a 
supervisor’s referent power and expert power 
cannot be specified by the organization. 
Expert power and referent power are idio- 
syncratic in character. On the other hand, 
reward power, coercive power, and legiti- 
mate power are nomothetic in char- 
acter and result from the supervisor’s oc- 
cupancy of a position in the organization’s 
role system. The formal organizational struc- 
ture is designed to provide equal legitimate 
power to all supervisors on the same hier- 
archical level and to give them equal access 
to the use of organizational rewards and 
punishments. However, supervisors are not 
equal in their referent power or their ex- 
pert power; the supervisor’s referent power 
and his expert power constitute his particular 
utilization of his formal position. Thus the 
author concludes that referent power and 
expert power are conceptually and qualita- 
tively different from reward power, coercive 
power, and legitimate power. Influential acts 
based on referent power and expert power 
constitute an increment in organizational in- 
fluence beyond the influence inherent in the 
routine functioning of the organization’s role 
system. Consequently, referent power and 
expert power are the basis of incremental 
influence, and incremental influence has been 
Operationalized as the additive combination 
of referent power and expert power. 

The present research relates incremental 
influence to work-group performance. Since 
recent research in a variety of organizational 
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settings casts doubt on the validity of job 
performance as a unidimensional construct, 
a multiple-criteria approach to work-group 
performance was used. Supervisory influence 
will be related separately to 12 performance 
measures. These measures were conceptual- 
ized by a framework suggested by March and 
Simon (1958) who state that organizational 
members face two different decisions—the 
decision to participate and the decision to 
produce. Consequently, the performance 
measures are considered as measures of with- 
drawal, or as measures of production; ac- 
cident rates are considered separately since 
this criteria does not fit into the withdrawal- 
production classification. An impressive body 
of research evidence (Kahn & Katz, 1960; 
Likert, 1961; Mann, 1965; Pfiffner, 1955) 
suggests that incremental influence (i.e., both 
referent power and expert power together) 
would relate to production measures. On the 
other hand, March and Simon (1958) hy- 
pothesized that withdrawal measures relate 
to satisfaction but not to performance, which 
in the author’s terms would suggest that 
withdrawal measures relate mainly to referent 
power. Two studies support this view. Fleish- 
man, Harris, and Burtt (1955) found a 
negative relationship between supervisory 
consideration and absenteeism; Fleishman 
and Harris (1962) found a negative relation- 
ship between supervisory consideration and 
turnover. Finally, in terms of the analytical 
model, the author considers that accidents 
are unintended consequences resulting from 
low supervisor “technical competence“ 
(Mann, 1965, defines technical competence 
as the ability to use pertinent knowledge, 
methods, techniques, and equipment neces- 
sary for the performance of specific tasks 
and activities) and the inability of the super- 
visor to impart his competence to his sub- 
ordinates. Accordingly, one would expect 
that accidents relate mainly to supervisory 
expert power. Consequently, the operational 
hypotheses were that incremental influence 
(i.e., both referent power and expert power 
together) would relate to the production 
measures; referent power would relate to the 
withdrawal measures; expert power would 
relate to the accident rate. 
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TABLE 1 


RELATIONSHIPS BETWEEN MEASURES OF SUPERVISORY INFLUENCE 


an 


Measure 1 | 2 3 | 4 | | 
1. Incremental influence 
2. Referent power AS bay! 
3. Expert power etre age" 
4. Reward power 4°" 32 43˙⁷ 
5. Coercive power 12 12 09 AS** 
6. Legitimate power 35* 38* 24 08 41 


Note.—Group means, N = 40. 
*p <.05, two-tailed. 

** p <.01, two-tailed. 

* p < .001, two-tailed 


METHOD 
Research Site and Sample 


This study employed data collected from a 
manufacturer of major home appliances. The data 
were collected for a larger study sponsored by the 
company and conducted by the Survey Research 
Center. The sample consisted of 486 hourly em- 
ployees and 39 first-line supervisors representing 40 
of the 52 work groups which comprised the com- 
pany’s two main plants, and the unit of analysis 
in this research was the work group. Measurements 
of attitudes, opinions, and perceptions of behavior 
were collected by paper-and-pencil questionnaires 
administered to nonsupervisory production em- 
ployees during March 1964. 


Questionnaire Measures of Supervisory In- 
fluence 


The measures of supervisory influence were as 
follows: 


Referent power. A single-item rating by sub- 
ordinates of the extent to which they comply with 
their supervisor’s directives because he is a “nice 
guy” and they don’t want to hurt him. 

Expert power. A single-item rating of the ex- 
tent to which subordinates comply with their 
supervisor’s directives because they respect his ex- 
perience and good judgment. 

Reward power. A single-item rating of the extent 
to which subordinates comply with their supervisor’s 
directives because he can give special help and 
benefits to those who cooperate with him. 

Coercive power. A single-item rating of the ex- 
tent to which subordinates comply with their 
supervisor’s directives because he can penalize 
or make things difficult for those who do not co- 
operate with him. 

Legitimate power. A single-item rating of the ex- 
tent to which subordinates comply with their super- 
visor’s directives because he has a right, consider- 
ing his position, to expect subordinates to do what 
he wants. 


Incremental influence. As noted earlier, this con- 
cept was operationalized by summing the group 
mean for referent power and the group mean for 
expert power. 


Table 1 presents the relationships between these 
measures of supervisory influence, while Table 2 
presents the means and standard deviations of in- 
cremental influence and the bases of social power. 


Measures from Company Records 


For purposes of the larger study from which 
the present research data are taken, a roster of 
work-group performance measures was developed 
so that a number of different measures of per- 
formance could be related to questionnaire data. 
Twelve of these work-group performance measures 
were used in the present research. These 12 measures 
were tabulated weekly by the company during & 
17-week period from immediately preceding and 
following the questionnaire administration. Six of 
the measures were direct measures of frequency or 
amount, and these measures were converted to a 
per- employee basis for each work group; the other 
six measures were based on ratings of each work 
group by the department head. The ratings were on 
4. point descriptive scales based on the firm’s ex- 
isting budgetary, allowance, or schedule standards. 
Actual group performance was compared against 


TABLE 2 


MEAN AND STANDARD DEVIATION OF INCREMENTAL 
INFLUENCE AND BASES OF POWER 


Measure M SD 
Incremental influence 6.12 92 
Referent power 2.73 47 
Expert power 3.30 59 
Reward power 2.78 40 
Coercive power 2.32 42 
Legitimate power 3.43 44 


Note.—Group means, N = 40. 


Supervisory INFLUENCE AND Worx-Grour PERFORMANCE 


TABLE 3 


Listinc anp Descairrion or Worx-Gaour 
PERFORMANCE Measures 


u 


Rating measures of performance 


Indirect costs (+) 
Performance rating based on actual versus budgeted 
amounts of inspection, stockhandling, general labor, 
make-up allowances, and rework labor (4-point scale), 
Maintenance costs (+) 
Performance rating based on actual versus budgeted 
amounts for maintenance, considering both type and 
quality of maintenance orders (4-point scale). 
Supply costs (+) 
Performance rating based on actual versus budgeted 
allowance for supplies (4-point scale). 
Scrap costs (+) 


Performance rating based on actual versus budgeted 
amounts for scrap and rework (4-point scale), 


Performance against schedule (+) 
Performance rating based on the extent to which the 
work group: stayed on or ahead of schedule (Rat- 
ing = 4), required help to stay on schedule (Rating = 
3), was behind schedule (Rating = 2), or was behind 
to the extent of causing down-time (Rating = 1). 


Quality (+) 
Performance rating based on actual versus antici- 
pated numbers of rejects, taking into consideration 
the disposition, amount, and kind of reject (4-point 
scale), 


Average earnings (++) 
The average earnings per hour per man for each 
work group. 


Excused absences (—) 
The number of occurrences of excused absence, 
divided by the number of men in the work group. 


Unexcused absences (—) 
The number of occurrences of unexcused absence, 
divided by the number of men in the work group. 


Accidents (—) 
The number of reported injuries divided by the num- 
ber of men in the work group. 


Turnover (—) 
The number of quits, transfers, and formal bids for 
jobs in other work groups and not involving advance- 
ment, divided by the number of men in the work 
group. 


Suggestions submitted (+) ! 
The number of suggestions formally submitted, 
divided by the number of men in the work group. 

> 8 


Note.—In each case, direction of scale is indicated by a plus 
or minus sign. ‘*+-’’ means that a high numerical rating repre- 
sents good“ performance. 
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these abstract standards. Table 3 lists the 12 work. 
group performance measures taken from the coms- 
Dany records and presents a description of cach 
measure. 

Note that a high score on any 1 of the 12 work- 
group performance measures may indicate either 
excellent or poor performance depending upon the 
specific measure in question 


Resucts 


The first hypothesis predicted that expert 
power would be positively related to a low 
work-group accident rate. The results from 
an analysis of this prediction are presented in 
Table 4 indicating that the hypothesis is sup- 
ported. Expert power is the only one of the 
five bases of supervisors’ power to relate 
significantly to a low accident rate. 

The second hypothesis predicted that meas- 
ures of withdrawal—excused absences, un- 
excused absences, and turnover—would be 
related to referent power. Specifically, the 
author predicted that high referent power 
would be associated with low absences and 
low turnover. The results relating to this 
prediction are presented in Table 5. 

Table 5 indicates that this prediction is 
only partially supported. As expected, referent 
power is associated with low excused absences. 
However, it was found that unexcused ab- 
sences and turnover are not associated as 
expected with supervisory bases of power. 
Surprisingly, high referent power tends to be 
related to Aigh turnover. This latter finding 
is contrary to most previous research which 
would suggest that supervisory behavior 
which is characterized by high referent power 
should result in employees’ continuing in the 
system. In the present case, however, un- 
excused absences are not at all related either 
to referent power or to the other bases of 
supervisory power, while turnover is related 
in a direction opposite to that predicted. 

The third hypothesis predicted high in- 
cremental influence (both referent power and 
expert power) would be related to good 
performance on those measures which were 
considered to represent measures of produc- 
tion. The results from an analysis of this 
prediction are presented in Table 6. 

The predictions of this study are generally 
supported. For four production performance 
measures (indirect costs, supply costs, quality, 
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TABLE 4 
RELATIONSHIPS BETWEEN ACCIDENTS AND SUPERVISORY POWER 


Subordinate work-group’s mean perception of their supervisor's 


Incremental Referent Expert Reward Coercive Legitimate 
influence power power power power power 
Accidents —44 Zü — 28° | — 03 | ~16 | 20 


TABLE 5 
RELATIONSHIPS BETWEEN WITHDRAWAL PERFORMANCE MEASURES AND SUPERVISORY POWER 


Subordinate work-group’s mean perception of their super visor's 


Withdrawal 
. performance 3 F 
measure Incremental Referent Expert Reward Coercive Legitimate 
influence power power power power power 
| Excused absences —.36* —.35* —.28* —.18 16 — 12 
Unexcused absences 00 —.02 02 18 02 —.08 
Turnover 12 .23 —.01 14 08 01 


Note. — Group means, N = 40. 
$p <.05, one-tailed. 


and suggestions submitted) the predicted on average earnings or performance against 
zero-order relationships are significant. For schedule. 
two other production performance measures 

(scrap costs and maintenance costs) the 

predicted zero-order relationships tend toward Although legitimate power ranked first 
significance. However, no relationships exist among the reasons for compliance with super- 
for two predicted relationships; incremental visory directives, legitimate power is not 
influence is not related to good performance related to differences in the performance of 
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TABLE 6 
RELATIONSHIPS BETWEEN PRODUCTION PERFORMANCE MEASURES AND SUPERVISORY POWER 


Subordinate work-group’s mean perception of their supervisor’s 


Production performance 


W Incremental] Referent Expert Reward | Coercive | Legitimate 

influence power power power power power 
Indirect cost performance 2 A0** 10 AS 22 .00 
Maintenance cost performance* 12 00 18 —.20 — 307 10 
Supply cost performance B 2¹ 822 ais 08 08 
Scrap cost performance 25 33˙ 13 .26 12 06 
Performance vs. schedule —.1¹ .05 —.21 —.06 04 —.05 
Quality 36* 32* ole 13 —.08 11 
Average earnings O1 00 0¹ — 40 22 05 
Suggestions submitted .28* 36" 14 09 40** 10 


Note.—Group means, N = 40. 
a Mean for 29 groups used as measure for mean of remaining 11 groups. 
* p <.05, one-tailed. 

* S. 01, one-tailed. 
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these subordinate work groups. The data 
suggest, therefore, that when the supervisor 
has a choice in the bases of power he em- 
phasizes, to achieve compliance with his di- 
rectives the supervisor should attempt to 
create and to use referent power or expert 
power rather than rely too strongly on his 
legitimate power. 

Reward power and coercive power, unlike 
legitimate power, do relate to a number of 
performance measures. However, the overall 
relationship between these two bases of power 
is two-sided. From the organization’s point 
of view the correlates of reward power and 
coercive power are problematic and have un- 
intended consequences for the organization. 
Organizationally speaking, reward power re- 
lates functionally to supply costs but dys- 
functionally to average earnings; coercive 
power relates functionally to suggestions sub- 
mitted but dysfunctionally to maintenance 
costs. Therefore the data suggest that the 
supervisor’s use of reward power and coercive 
power should be tempered where possible by 
the creation of referent power and expert 
power. Since there appear to be no dysfunc- 
tional or organizationally unintended cor- 
relates of incremental influence, the latter 
should be created and used where possible. 

As was noted above and as predicted, in- 
cremental influence has positive and organiza- 
tionally functional correlates. However, it is 
important to note that only 7 of the 12 per- 
formance measures relate to incremental in- 
fluence. Five measures, two withdrawal meas- 
ures (unexcused absences and turnover) and 
three production measures (maintenance 
costs, performance against schedule, and 
average earnings), showed no relationship to 
incremental influence. The reason for the 
difference in relationship between incremental 
influence and performance is not altogether 
clear. Perhaps the five performance measures 
which do not relate to performance represent 
areas or factors over which the first-line 
supervisor has little or no control. Unexcused 
absences and turnover might be related to 
extraorganizational considerations or to or- 
ganizational factors which are unrelated to 
compliance with the first-line supervisor. The 
three production measures which do not re- 
late to incremental influence may be said 


to represent company-wide or system factors 
rather than to be related to factors at 


nance programs, production schedules, and 
pay rates, respectively. 

Finally, it should be noted that the correla- 
tions for referent power and expert power are 
not generally additive, and referent power 
is a better predictor of performance than is 
expert power. Referent power is significantly 
related to five performance measures (excused 
absences, indirect costs, quality, suggestions 
submitted, and scrap costs), while expert 
power relates significantly to four measures 
(accident rate, excused absences, supply 
costs, and quality), Only in the case of 
quality is the correlation for incremental 
influence greater than the correlations for 
referent power and expert power, which sug- 
gests that in this case the bases of power 
are additive. 

The present research has shown the con- 
cept of incremental influence to be a useful 
and viable one. The qualitative distinction 
between referent power and expert power 
on the one hand and reward power, coercive 
power, and legitimate power on the other 
was generally supported by the data. The 
performance correlates of referent power and 
expert power are surprisingly large in view of 
the fact that the research did not include 
extraorganizational factors and included only 
one level of management. Moreover, the first- 
line supervisor in the present research site is 
a supervisor who may be said to have a 
relatively small area of freedom; his role 
range is limited by industrial engineers, 
quality-control specialists, time and motion 
study men, and others, as well as by the 
standardized nature of high-volume, single- 
product production. In view of the limitations 
inherent in the role which was studied, the 
findings are encouraging and further research 
and validation of these findings are indicated. 
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INFLUENCE OF TRAINING, METHOD, AND RELATIONSHIP 
ON THE HALO EFFECT 


EVA METZGER BROWN * 
Teachers College, Columbia University 


120 student nurses each rated 6 peers on 10-point scales for é sts of 
The variance of a judge's ratings for each object was found, and the 
scores of the 6 objects for each judge summed. Thi dispersion score 
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as the measure of the halo effect. It was found that trained judges exhibited 
less halo than untrained judges, but that ratings made by rating 1 individeal 
on 6 traits at a time yielded substantially the same amount of halo as did 
ratings made by rating all individuals on 1 trait at a time, The relationship 
between the judge and the object was a significant variable for the untrained 
judges. Here the ratings for the less intense relationship exhibited significantly 
more halo than the ratings for the intense relationship, Training of the judges 
improved all ratings to the point where the judge-object relationship did not 


play a significant role. 


The halo effect poses a problem for the 
investigator because of its pervasive yet 
elusive presence in the rating situation. It 
occurs when “a particular rater tends to rate 
a particular ratee similarly on all traits” 
(Guilford, 1959, p. 146), for any reason. The 
halo effect was noted briefly by Wells in 
1907 and labeled by Thorndike in 1920. 

Statistically, the halo effect has been de- 
fined in three ways: (a) as an inflated inter- 
correlation among traits (Gilinsky, 1947; 
Taylor & Hastman, 1956), (b) as a general 
bias factor arrived at through matrix and 
factor analyses (Grant, 1952; Meyers, 1965), 
and (c) as the judge-object (-O) inter- 
action error evaluated according to Guilford’s 
(1954) analysis-of-variance model. The first 
definition, although responsible for the orig- 
inal concept of the halo effect, is now viewed 
as lacking in sophistication. The findings of 
factor-analytic studies, while of value in 
identifying halo, are limited to a specific ex- 
perimental situation. The third definition is 
the most sophisticated in that it differentiates 
between objective (valid) and relative (in- 
valid) halo, and the more useful in that it is 
applicable to all studies where a number of 


1The research reported in this paper is based on 
a doctoral dissertation submitted to Teachers Col- 
lege, Columbia University. Special thanks are given 
to Laurance F. Shaffer, chairman of the dissertation 
committee, Elizabeth P. Hagen and Rosalea A. 
Schonbar, members of the committee, and to 
Rosedith Sitgreaves, consultant to the project. 


Js are rating common Os on a common scale. 
Although the data of the present study were 
not based on ratings of common Os, the 
conception of Guilford (1959) was used to 
define halo as the J-O interaction error and 
to apply the analysis-of-variance technique. 
In the study, the halo effect was defined as 
inversely proportional to the variance score 
for ratings given an O by a J over six sets 
of traits. 

The main purpose of this study was to 
identify the conditions which minimize the 
operation of the halo effect. The study ex- 
amined the influences of the training of the 
Js, of the method of rating, and of the 
quality of the -O relationship. Previous re- 
search has indicated that Js benefit from 
training when it includes: (a) practice with 
the specific rating scale (Wakeley, 1961), 
(b) discussion of errors in rating by the Js 
(Levine & Butler, 1952), and (c) special 
emphasis on the importance of trait differ- 
entiation (Taylor & Hastman, 1956). As for 
the method of rating, Symonds (1925) sug- 
gested that more halo results when a J rates 
one individual at a time on all traits (Method 
1), than when a J rates all individuals on one 
trait at a time (Method 2). Method 2 has 
been widely accepted as superior to Method 
1, but recent studies (Blumberg, DeSoto, & 
Kuethe, 1966; Johnson, 1963; Taylor & 
Hastman, 1956) have reported substantially 
the same degree of halo effect for both 
methods. Characteristics of the -O relation- 
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ship which influence the J’s ratings of the O 
have not been studied adequately. Greater 
attention has been focused on the level of 
acquaintance between the J and the O (Bare, 
1954; Koltuv, 1962), and the data support 
the hypothesis that increased acquaintance is 
related to decreased bias in judgment. 

The broad hypothesis of the study is that 
the operation of the halo effect is influenced 
by the training of the Js, by the method of 
rating used by the Js, and by the intensity 
of the relationship between the J and the O. 


METHOD 
Subjects 


One hundred and twenty student nurses in at- 
tendance at St. Luke's Hospital School of Nursing, 
New York, served as subjects. They were divided 
into four groups: Al, A2, BI, and B2. Groups A 
were trained, whereas Groups B were not trained; 
Groups 1 made ratings according to Method 1, 
whereas Groups 2 made ratings according to 
Method 2. 


Instruments 


The instruments were designed and adapted to 
serve the needs of the study, and written instruc- 
tions preceded each questionnaire. 

The Information Questionnaire was used to 
gather some background information on each J. 
The Form for the Selection of Objects required 
each J to select three Os toward whom she felt 
as favorable as possible (intense relationship) and 
three Os toward whom she felt indifferent (less 
intense relationship). All Os were fellow students. 

The Acquaintance Checklist assessed how well 
the J felt she knew the O, while the Social Dis- 
tance Scale measured how well the J liked the O. 

The Rating Scale consisted of six bipolar traits 
(factors) on 10-point scales. The factors, selected 
from the Guilford-Zimmerman Temperament Survey 
(Guilford & Zimmerman, 1949), included lethargic- 
active, impulsive-self-controlled, withdrawn-sociable, 
emotionally unstable-emotionally stable, mannish— 
feminine, and hostile-friendly. 

The Postexperimental Reaction Questionnaire 
tapped the J’s knowledge of the aims of the study 
and her previous experience in rating others. 


Procedure 


The data were collected in four 1-hr. periods 
spaced 1 wk. apart. No more than 30 Js were seen 
per session and no more than 15 Js were trained 
at a time. 

In Session I a broad outline of the procedure 
relevant to the trained and untrained groups was 
given, and the confidentiality of ratings was stressed. 
The Js completed the Information Questionnaire and 
selected their Os. 
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In Session II, Group A was trained, while 
Group B did not meet. The experimenter, acting 
as an instructor, told Group A about different kinds 
of rating scales and rating procedures, problems in 
obtaining sound ratings, and constant errors. The 
Js were given some practice on the Rating Scale 
and their ratings were discussed. 

In Session III, the Js completed the Acquaintance 
Checklist, the Social Distance Scale, and the Rating 
Scale, In Session IV, these three scales were re- 
administered and their test-retest reliabilities found 
to be 95, 96, and .85, respectively. Finally, the 
Postexperimental Questionnaire was completed. 


RESULTS 
Characteristics of the J-O Relationship 


Student nurses were selected as subjects 
because there was a basic level of familiarity 
among them. In the sample used, however, 
91% of the “favorable” Os were known “ex- 
tremely well” or “very well” by the Js, 
whereas 62% of the “indifferent” Os were 
known only “a little” or “hardly.” The cor- 
relation between the Acquaintance Checklist 
and Social Distance Scale was .79. It was 
concluded that the Os who had an intense 
relationship with the J were well known to 
the J, and the Os who had a less intense 
relationship with the J were less well known 
to the J. 


Influence of Training and Method on the 
Halo Effect 


The means and standard deviations for 
the variance scores of the experimental groups 
appear in Table 1. Each mean value was 
based on 30 scores, except for the combined 
“Groups” and “Methods” where the mean 
value was based on 60 scores. Each score was 
the sum of one J’s variance scores for three 
favorable or three indifferent Os, or the sum 
of one J’s variance scores for six Os when 
the relationships were combined. The higher 
the score the smaller was the halo effect, and 
the lower the score the greater was the halo 
effect. 

The influence of training and method on 
the halo effect was evaluated by a two-way 
analysis of variance (Table 2). The signifi- 
cance of the variable of training at the .0! 
level led to the conclusion that a brief but 
specific training period leads to more analytic 
and discriminating judgments. 

The variable of method was nonsignificant, 
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Mans axo STA Devianows or Rn al 
Vagiance Scomts Accomptne To Trait, 

Merion, axo Retatiowsmr 

F 


| M 
Experimental groups 1 


Intense relationship 
x 
3 
Less intense relationship 
3 
Both relationships 
7 
s 11.24 


Uni 


Intense relationship 
s 
les intense relationship 


s 
Both relationships 
x 


“indicating that ratings made according to 


Method 1 and Method 2 did not differ sub- 
stantially from one another. A significant 
interaction between training and method at 
the .05 level was found to exist. A further 
evaluation of differences using Scheſſé's 
method indicated that the trained and un- 
trained groups differed significantly only 


when they made ratings according to Method 


1. Hence, the significant interaction reflected 
the effect of training on Method 1. 


TABLE 2 


Two-Way ANALYSIS OF VARIANCE: THE INFLUENCE OF 
TRAINING AND METHOD ON THE Hato Errect 


Source of 

variation af SS 
Training 1 935.36 
Method 1 103.50 
Interaction 1 341.32 
Within groups 116 | 9985.31 
Total 119 | 11365.49 

* = 3.92. 


b Fs = 6.85. 


197 


infiuence the J-O 
— — Relationship om the 


582 
giles 
Feri 


E 
Ẹ 
í 
z 
: 
f 
Ba 
rst 


38 EE 
irp 
filer 
alist 
PAR 

HHTH 

i Ha 

2858 850 


: 
i 
F 


s level of significance. A difference between 
TABLE 3 


CHI-SQUARE: THE INFLUENCE or Intenstry or RELA- 
TIONSHIP ON THE Hato Errecr wrr 
Trarsinc Herp Constant 


198 


the intense and less intense relationship was 
thus found only for the untrained group. 
An examination of the mean variance scores 
(see Table 1) indicated that for the un- 
trained group the ratings of the less intense 
relationship exhibited a greater amount of 
halo, while for the trained group the ratings 
became differentiated to the point where the 
relationship between the J and the O did not 
play a significant role. 


Discussion 


The conception of the measure of halo as 
a variance score was a new one. The variance 
score was a direct statistical translation of 
the definition of halo which stated that “a 
particular rater tends to rate a particular 
ratee similarly on all traits” (Guilford, 1959, 
p. 146). With the variability of ratings 
built into the basic score, individual differ- 
ences in the relative amount of halo among 
Js could be identified. In addition, group dif- 
ferences could be found through the analysis- 
of-variance technique so that conditions 
which functioned to increase or decrease the 
amount of halo in ratings could be identified. 
This procedure is practical and can be applied 
to any setting where Js are rating Os on 
graphic rating scales. However, a shortcoming 
of the procedure is that the statistical signifi- 
cance of the objective and relative components 
of the halo effect cannot be determined in- 
dependently as in Guilford’s (1954) model. 
One has to assume that a large difference in 
variance scores indicates significant variation 
in the relative component of the halo effect. 
Such a difference was found between the 
trained and untrained groups. 

Training small groups of inexperienced Js 
was very effective in reducing the amount 
of halo even when such training was brief 
(1 hr.). The fact that the subjects were ac- 
customed to the role of the student and ac- 
cepted the experimenter as an instructor 
probably had a positive influence on the 
findings. 

The training process was affected by two 
uncontrolled features, the first of which led 
to a difference in the observed motivational 
level between the trained and untrained 
groups. It is generally assumed that Js who 
are more involved and more interested in 
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the task will make more careful and more 
accurate ratings (Conrad, 1932). In the 
study, the trained group had more contact 
with the experimenter and more general in- 
formation about the study, It was observed 
that the trained Js wrote more on the Re- 
action re and exhibited greater 
interest in the results of the study than the 
untrained group. Therefore, it is proposed 
that the trained Js had more invested in the 
ratings they were making than the untrained 
Js and hence took more care in making 
discriminating ratings. Further exploration of 
other effects of the training period, apart 
from learning, is required. For example, to 
what degree is the amount of halo in ratings 
affected by increased experimenter-J contact 
or by a desire to please the experimenter? 

The second uncontrolled feature of the 
training process lay in the content of the 
training session and is proposed as an ex- 
planation for the significant interaction found 
between training and Method 1. During 
the training session, practice in rating and 
discussion of ratings were based solely on 
Method 1. It follows that Js who rated 
according to the same method by which they 
were trained could apply what they had 
learned more directly than could the Js who 
rated according to Method 2 and were trained 
according to Method 1. This shortcoming in 
the study points out the importance of having 
the training session equally relevant for all 
the variables in the experiment. 

As for the method of rating, a sizable 
sample of student nurses rated Os with 
whom they were familiar similarly whether 
they used Method 1 or Method 2. Added to 
the accumulated body of research (Blumberg 
et al., 1966; Johnson, 1963; Taylor & Hast- 
man, 1956), the findings affirm the irrelevance 
of this methodological variation for the halo 
effect. Therefore the often suggested recom- 
mendation that ratings should always be 
made by Method 2 (Guilford, 1954, p. 270 
seems poorly founded in the case of graphic 
rating scales. 

The significance of the J-O relationship 
for the halo effect varied with the training 
or lack of training of the Js. For the untrained 
group, the ratings of indifferent Os, the less 
intense relationship, exhibited greater halo 
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than the ratings of favorable Os, the intense 
relationship. As the indifferent Os were kss 
well known than the favorable Os, the fad- 
ings of Bare (1954) and Koltuy (1962), 
that increased acquaintance is 
judgment, 
supported. However, in the 

none of the Os was as “unfamiliar” to 

Js as was the case in Koltuy's study, nor as 
briefly known as was the case in Bare's study. 
The student nurses had been going to school 
and living together for at 
half, so that it could be 
Js had some basis on which to make differ- 
entiated ratings for the indifferent as well as 
the favorable Os. A possible explanation of 
the findings was offered by the Js, themselves. 
On the Reaction reported 
difficulty in selecting Os for the less 
relationship. Apparently, there 
resistance to and anxicty in 
different Os than favorable Os, for both 
trained and untrained group. As the 

relationship did not play a significant 
in the ratings of the trained group, it is 
proposed that the training period not only 
functioned as a learning experience, but also 
tended to reduce the anxiety of the Js and 
hence the restrictiveness of their ratings. The 
latter function would be particularly pertinent 
for ratings of Os toward whom the Js 
“less favorable” and possibly 


halo increase with increased 
or is there a U-shaped distribution where the 
amount of halo is greatest for ratings of Os 
toward whom the J is ambivalent? 
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DRIVER GAP ACCEPTANCE AT INTERSECTIONS 


WILLIAM L. GIBBS 
United States Bureau of Public Roads, Washington, D. C. 


To determine the most probably correct theory of the several currently accepted 


theories of driver gap acceptance, 


the gap limen was investigated by the 


method of constant stimuli. In a field-test situation, drivers were exposed to 
an approaching vehicle for a 1-sec. interval at various times during its 
approach, Approach speeds of 16, 32, 48, and 64 mph were used. Analysis 
of the data indicates drivers accept a gap equal to 5.4 times the approach- 
vehicle velocity (expressed in ft/sec). The obtained function most nearly 
supports the time hypothesis of driver gap acceptance. 


Driver decisions at street intersections re- 
quire a complex series of judgments. The 
driver is required to detect any approaching 
vehicle, determine range and speed of ap- 
proach, then decide whether to cross, or 
merge with, the traffic stream. Although 
drivers accomplish the task with great fre- 
quency and a high degree of safety, com- 
paratively little is known about the actual 
process or processes utilized by the driver. 
Early experimenters (Es) made gross mea- 
surements of yehicle movements, and devel- 
oped theories of gap acceptance based on 
these measurements, but comparatively little 
is known about the actual decision processes 
of the driver at the intersection. 

Development of several different theoretical 
approaches to the driver’s acceptance of gaps 
at intersections has followed the initial study 
by Greenshields (1947). Early Es measured 
and cataloged the gap acceptance of drivers 
in free traffic situations to develop the early 
expressions of the model of driver gap accept- 
ance. Following are the major hypotheses 
relating gap acceptance to various aspects of 
cue structure or cognitive organization of such 
cues. 

1. Time hypothesis: The Greenshields 
model considered gap acceptance judgments 
as if the driver made a precise estimate of 
arrival time of the approaching vehicle. Thus 
a driver would make the judgment that there 
was some time period before the oncoming 
car would arrive at the intersection and thus 
he could, or could not, cross, or merge with 
the mainstream traffic. 

2. Modified-time hypothesis: Evolving from 
the time hypothesis, a more complex expres- 


sion for the time-gap acceptance decision 
process was developed by R. L. Moore 
(1956). Working with pedestrians, he re- 
quired them to make an estimate of the last 
possible moment they would attempt to cross 
a roadway. (They did not actually cross.) 
The function he developed was of the order 
D= 45 + 2.9%, where D = distance in feet 
from the pedestrian to the approaching 
vehicle, and / = approach speed of the 
vehicle in feet per second. His experiment 
covered the range of approaching vehicle 
speeds of 15-60 mph with 100 observations 
at each data point. One particular item of 
interest is that as speed of the approaching 
vehicle (V) increased, there were more po- 
tentially hazardous errors in that insufficient 
time was allowed to cross the street. In fact, 
at 60 mph there were 35 judgments where 
insufficient time was allowed for crossing 
of the 100 total judgments made at that 
speed. 

3. Distance hypothesis: A third model, that 
of constant distance, has been postulated. 
This theoretical approach says that the driver 
will accept a gap of a constant distance re- 
gardless of the speed of the approaching 
vehicle. 

Brain (1962) in a largely theoretical treat- 
ment of gap acceptance considered several 
additional models: 

4. Rate of change of angular position: The 
apparent movement of the approaching vehicle 
in relation to the coordinates of the head of 
the driver in space. Brain showed that these 
cues are of limited value because the rate 
of change is approximately zero when the two 
cars are approaching each other at speeds 
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which would produce a collision, and also 
when the driver's car is halted at the stop line. 

5. Rate of change of horizontal dimension 
due to apparent rotation: As the vehicle ap- 
proaches, the changing appearance of the car 
as it changes from head-on to more and more 
side view may provide the cues to cross or 
not to cross. Brain contends that the values 
derived from the condition of two cars 
on a collision course with the driver’s vehicle 
halted at the stop line will provide insufficient 
cues. This would, in addition, require the 
postulation of a mechanism to develop the 
length/breadth ratio required and then in- 
tegrate the length/breadth ratio over time to 
develop the decision. Brain considers that the 
complexity of such mechanism makes this 
model highly improbable. 

6. Rate of change of dimension due to 
distance: Brain worked with height of the 
vehicle to derive a theory of angular velocity 
similar to that of Michaels and Weingarten 
(1965). Although Brain’s (1962) theoretical 
derivation indicates this model is most de- 
scriptive of the process the driver follows, 
after experimental work he states, “It would 
appear ... that one relies on visual image 
size based on past experience rather than 
calculus for estimating minimum time inter- 
vals [p. 791].” 

7. Experience: The method, according to 
Brain, which is apparently used by drivers. 
The crossing decision is based on known size 
of the visual image and the expected perform- 
ance of the approaching vehicle, and not on 
subtle cues derived from the change or rate 
of change of the image of the approaching 
vehicle. 

For the most part, all preceding studies 
(which used Ss) have used free observation 
wherein Ss could see the approaching vehicle 
until he made his decision. The decision of 
S was thus a summation of information over 
an extended period of time. 

In the present study, an information 
acquisition time (IAT) of 1.0 sec. was 
selected to permit a driver sufficient time to 
make a crossing decision. Use of an IAT 
of 1.0 sec. was considered to have two ad- 
vantages: (1) it appeared to be close to the 
actual decision times utilized by drivers, and 
(2) it permitted a precise determination of 
the threshold of crossing decisions due to the 
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experimental control over the cues used to 
make the gap acceptance decision. 

This experiment was for the purpose of 
determination of the threshold of a driver’s 
gap acceptance by the use of the psycho- 
physical method of constant stimuli in order 
to permit a comparison of the different 
models of crossing behavior which have been 
proposed by several different Es. 


METHOD 
Subjects 


Four male Bureau of Public Roads employees 
who had not had prior contact with the experiment 
acted as Ss. Vision was tested and found to be 
within normal tolerances for all Ss. Age ranged 
from 31 to 62 yr. with an average of $1 yr. Driving 
experience was commensurate with age. 


Apparatus 


Two automobiles were used. One was parked 
at an “intersection” while the other was the stimulus 
vehicle (SV). 

The S was seated in the parked vehicle with his 
vision restricted by an electrically operated shutter 
mounted in the driver's side window of the auto- 
mobile. A chin rest associated with the shutter 
assured precise positioning of S's head. A system of 
pneumatic tubes and electronic timers controlled 
the shutter opening time and duration of 1.0 sec. 
Shutter opening time was controlled for cach run 
by the preset times which were manually inserted 
into the timer before each run. Total time for each 
run was also recorded in order to measure ac- 
curacy of SV approach velocity. 


Procedure 


The testing was conducted on an airport runway 
200 ft. wide and 3000 ft. long. 

With one vehicle “stopped” at a simulated inter- 
section with a through street, the SV approached 
from the left and passed in front of the S in the 
stopped vehicle. The centerline of the SV path was 
approximately 20 ft. from S and perpendicular to 
the centerline of the stopped vehicle. 

Four approach speeds were used for the SV: 16, 
32, 48, and 64 mph. Preset times were so deter- 
mined that the approach times, the time between 
the closing of the shutter and the SV arriving at a 
point directly in front of S, were in the same 
increments for all speeds. Seven approach times 
were used ranging from 1.8 to 9.0 sec. in 1.2-sec, 
steps. 

With 4 speeds and 7 approach times there were 
thus 28 conditions (cells). Although the method of 
constant stimuli was used, a maximum of 10 trials 
was presented for any cell due to time and cost of 
each trial. To further reduce the number of trials 
the method of sequential analysis was used. Necessary 
data for the sequential analysis computations were 
obtained from pilot data. Using cumulative totals 
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for each cell, further trials were discontinued when 
the number of “yes” or “no” responses exceeded 
the limits of the sequential analysis table. 

The task of the S was to observe the approaching 
vehicle and make judgments. The following in- 


structions were read to S: 


We are conducting a controlled study of people's 
ability to determine whether they have appropri- 
ate gaps in traffic so they can cross through 
intersections. Through our simulated intersection 
you are asked to observe the vehicle that is ap- 
proaching just as though you were sitting at a 
stop sign at an intersection. At a certain period 
of time during the approach of the vehicle a 
shutter in this viewing device will open. You 
are to judge during this period whether or not 
you would accept the gap and cross through the 
intersection. Base your decision on the position 
of the car just as the shutter closes. Try to 
make judgments as you normally do when driv- 
ing. Prior to the time when you are requested 
to make a judgment I will say “ready.” At that 
time you should put your chin in the chin rest. 
Shortly afterward you will see the shutter open. 
After the shutter closes you are to indicate by 
saying “yes” if you would have crossed through 
the intersection, or “no” if you would not. 
Between each of these trials you may take your 
chin away from the chin rest and relax. 


Intertrial interval was about 1.5 min. Each S 
received about 100 trials per day with rest periods 
after each 25 trials. During testing, Ss read books 
or magazines between each trial. 


RESULTS 


In order to determine whether there was 
a difference between Ss, a comparison was 
made of the number of acceptance judg- 
ments at each of the four speeds by use of 
the chi-square statistic. The results indicate 
the performance of Ss was different: at 16 
and 32 mph the difference was significant 
at the p < .02, while at 48 and 64 mph the 
difference was Pp C .01. Due to the signifi- 
cant differences obtained and the small num- 
ber of Ss, extreme caution should be used in 
interpretation of further analysis. Whenever 
possible, each S will be considered sepa- 
rately. 

Determination of the limen and standard 
deviation of gap acceptance was performed 
for each S by the normal graphic process 
(see Table 1). The limen in the present 
experiment is the point at which S would 
theoretically accept a given gap on 50% of 
the trials. The obtained limen was the same 
as the approach time each man would accept 


TABLE 1 


Tore Loren as A FUNCTION oF APPROACH 
Venicte VeELocity 


Velocity of approach vehicles 


Subject |————__ —— 
16 32 48 64 

Si | 

Limen 3.05" 2.97 4.0 3.5 

SD 32 32 77 | 54 
S: 

Limen 7.15 5.33 6.30 5.28 

SD 63 32 83 31 
Ss 

Limen 7.67 5.33 4.89 4.18 

SD 87 33 45 33 
Se 

Limen 4.85 6.75 6.69 8.05 

SD 98 1.67 75 9¹ 
Mean limen 5.68 5.10 5.47 5.25 
SD 70 60 70 52 

TEA a 


50% of the time. Although there was marked 
individual variation in threshold, individual 
slopes were similar, and the mean approach 
time accepted was remarkably constant over 
the speed range. 

Using the time gap obtained for each 
man, the gap distances which he accepted 
were calculated. These data, presented in 
Table 2, were corrected to reflect the precise 
velocity of the stimulus vehicle. 

To permit comparison of the four cur- 
rently accepted theories of gap acceptance, 
a plot was made of the curves derived from 
previously derived empirical and theoretical 


TABLE 2 


DISTANCE LIMEN AS A FUNCTION OF ÅPPROACH- 
VEHICLE VELOCITY 


Velocity of approach vehicles 


Subject 
16 32 48 64 

Sı 73.11 | 141.55 | 289.64 | 336.56 

Ss 171.38 | 254.03 | 456.18 | 507.72 

Ss 183.85 | 254.03 | 354.08 | 401.95 

Sa 116.25 | 321.71 | 484.42 | 774.09 
M 136.15 | 243.07 | 396.08 | 504.84 
SE 25.63 37.33 45.19 96.36 


1777!!! . —ù 
In miles per hour. 
b Ini feet. 
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Fic. 1. Gap accepted as a function of approach-vehicle velocity. 


values obtained from these theories (Fig- 
ure 1). 

The plot of the gap accepted versus the 
velocity of the approach vehicle permits 
graphic comparison of expected results 
which would be predicted by each theory. 
Data from the present experiment (Table 2) 
are included with the range of obtained 
values indicated by vertical lines and the 
crossbar indicating the standard error. These 
data produce a function which fit the equa- 
tion G = 5.4 V, where V = vehicle velocity 
in feet per second. 


DISCUSSION 


The approximately 5-sec. median gap size 
accepted by drivers in the present study 
agrees with Raff (1950) and Bissell (1960), 
and is only slightly shorter than that found 
by Robinson (1951) and Wagner (1966). 
In general, lower speed ranges were utilized 
in previous studies, but the congruence of 
the data, within similar speed ranges, in- 
dicates that the experimental controls im- 
posed in the present experiment did not 


bias the results. The congruence of the data 
with most median accepted gaps in the 8-6 
sec. range provides a strong indication that 
the time hypothesis is the most probable 
basis for driver gap acceptance decision. 

Graphical analysis of the data also tends 
to support the time hypothesis as the basis 
for driver gap acceptance. Analysis of the 
data indicates the time hypothesis most 
closely approximates the correct description 
of the actual gap acceptance process. The 
results obtained in the present experiment 
strongly disagree with the modified-time 
hypothesis. Although at the two lower speeds 
it would appear that any of the hypotheses 
could be supported, the overall trend of the 
obtained function refutes these three hy- 
potheses. 

The constant necessary to approximate the 
fit between the time hypothesis and the ob- 
tained data requires some discussion. The 
marked change in gap distance required 
by Ss is considered to be a function of (a) 
perceptual processes and (5) cognitive fac- 
tors. Perceptual factors include such factors 
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as: image size, vehicular image oscillations, 
and velocity estimation. The cognitive fac- 
tors include: S set, and risk-taking char- 
acteristics. 

Image size is considered to give very 
important cues to distance judgments in a 
static environment. In the present experi- 
ment the short (1.0 sec.) information 
acquisition time (IAT) emphasized any 
tendency of the Ss to make judgments of 
the approaching vehicle as if they were 
making judgments of a stationary object. 
A large proportion of the gap acceptance 
judgment is considered to be based on the 
correctness of judgment of image size. Brain 
(1962) considered that Ss rely on visual 
image size based on past experience rather 
than calculus for estimating minimum time 
intervals. Thus the factors of image size and 
prior knowledge which form the basis for 
image size judgments combine to produce 
one component of the gap acceptance judg- 
ment. 

Vehicular image oscillations contribute 
some cues to the judgment. Brain (1962) 
stated that, “there was a conscious effort to 
allow for the speed of faster vehicles from 
the rate of apparent image growth (over the 
long viewing time he used) and from the 
amount of yaw, roll, and general visual re- 
sponse to carriageway irregularities . . . 
[p. 791].” Thus it is possible that Ss in 
the present experiment did perceive slight 
vehicular oscillations during the short IAT 
and hence could make some judgments of 
relative approach speed. Using the oscilla- 
tion cue as a modifier of the information 
from the image size/distance cue, S made 
a gap acceptance, or rejection, judgment. 

The perceptual factors are further modi- 
fied by cognitive factors, such as the set of 
the S. The present experiment required Ss 
to respond “normally,” whereas most other 
studies have requested S to judge “the last 
moment for crossing” (Brain, 1962; Moore, 
1956). If Ss were following the instructions, 
then the magnitude difference between the 
obtained data and previous data could be 
reconciled. However, the slopes obtained by 
previous investigators and the slope of the 
present data do not agree. Therefore some 
alternative, or additional, process may be 
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operating in the different gap acceptance re- 
search situations. The results obtained were 
considered to be influenced by Ss acting ac- 
cording to a “normal” set that resulted in 
a more accurate estimation of gap acceptance 
behavior than previously available. 

Risk-taking characteristics of Ss ap- 
parently differ significantly as indicated by 
their performances in Table 1. Note that 5; 
accepted gaps of very similar size at all 
speeds, while there was considerable differ- 
ence in the gap accepted by the other 
drivers. The S, was the youngest of the Ss, 
a bachelor, and inclined to take risks (per- 
sonal knowledge). The other Ss were older, 
married, and not inclined to take risks (from 
interview statements). With present data a 
specific statement cannot be made on risk 
taking. However, future research is planned 
to provide information on this aspect of gap 
acceptance. 
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INDIVIDUAL VERSUS GROUP PROBLEM SOLVING 
IN AN INDUSTRIAL SAMPLE 


JOHN P. CAMPBELL 
University of Mimaneta 


A comparison of the quality of solutions to N. R. F. Maker's Chaser of 
Work Procedure problem under 5 experimental conditions; (1) ledividesl 
solutions, (2) Individual solutions after hearing group dixwuion witheet 
criticism, and (3) group solution after dicunlon, using a sample of 0 294- 
and Srd-level managers from a large midwestern public utility. Motivations! 
(Bass's Orientation Inventory) and biographical correlates of individual and 
group problem-solving behavior were alo examined. Group diwusden did not 
improve individual problem-solving efforts; group solutions tended to be of 
significantly lower quality, There also tended to be a negative correlation be- 
tween solution quality and orientation toward maintaining group interaction. 


When compared to the efforts of individuals 
working by themselves, do committee meet- 
ings and group discussions facilitate finding 
higher quality solutions to organizational 
problems? As pointed out by Gerard and 
Miller (1967) and Hoffman (1965), this has 
been a viable issue for a long time but con- 
siderable previous research on the topic has 
yielded conflicting results. Much of the earlier 
work tended to show that groups were better 
problem solvers than individuals (Lorge, Fox, 
Davitz, & Brenner, 1958), while more recent 
studies, such as those by Tuckman and Lorge 
(1962), Taylor, Berry, and Block (1958), 
and Dunnette, Campbell, and Jaastad (1963), 
have shown that under certain conditions a 
group may actually inhibit the best effort of 
its members. However, another recent study 
(Hall, Mouton, & Blake, 1963) tends to 
favor the superiority of the group solution. 

The difficulties involved in interpreting 
much of the past research have been cited by 
Lorge et al. who point out that such investiga- 
tions have frequently been carried out on 
ad hoc groups (in which the group members 

ve not previously met and probably never 
will again) of students, and the experimental 
problems are often of the puzzle or anagram 
type. Another difficulty lies in the definition 
of what is an individual. Should the average 
group score be compared to the average in- 
dividual score or to some index derived from 
the “nominal” group (Faust, 1959)? In a 
Nominal group, individual efforts are com- 
bined in some fashion by an independent 


party, for example, by choosing the beat in- 
dividual solution in 


addition, an effort is made to relate the differ- 
ential performance of an individual (isolation 
versus group discussion) to certain measures 
of individual differences. Attempts to find out 
what kinds of individuals are benefited or in- 
hibited most by the group experience are 
totally lacking in the literature. 


METHOD 
Subjects 
The Ss were 80 second- and third-line managers 
from a large midwestern public utility employing 
about 5,000 people. Most of the participants knew 
one another fairly well and many had previously 
been in meetings together, Thus, the groups used 
in the present study at least approach what Lorge 
et al. call a traditional group. 


The Experimental Problem 


problem used was a slight modification of the 
Change of Work Problem developed by Maier and 
described in Maier, Solem, and Maier (1957). In 
its present form it reads as follows: 
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to the alternatives roughly corresponding to Maier's 


ceived a score of O. The alternative with the highest 
possible score entailed: (a) ranking jobs for each 
person in terms of how well he can perform each 
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one; (Ò) having each person rotate among the 
three jobs such that he spends the most time em 
his best job, ete; (c) carefully discuming the sd- 
vantages and disadvantages of the new method ia 
a group meeting with full opportunity for the em- 
ployees to alr their fears; (d) providing for the 
experimental tryout of various rotation schedules 
so that the employees and foreman will be better 
equipped to choose the best one; and (e) dhcusing 
the possibility of setting some production goals te 
shoot for during the transition period. Some rep- 
resentative alternatives between these two et- 
tremes are: rotating between the two most suitable 
jobs and spending more time on the more suitable 
(35 points); equal time rotation between the twe 
best jobs but for periods longer than an hour (20 
points); accepting management's new method with. 
out benefit of experimental tryout or group dis 
cussion of the pros and cons (5 points). At all the 
steps along the scale, bonus points are given for 
adopting an experimental outlook with regard te 
the various elements in the solution, providing for 
maximum two-way communication between manage- 
ment and the employees, and for attempting to st 
transitional production goals in consultation with 
the employees. 


Other Measures 


All Ss in the study filled out an extensive 60-item 
personal-history questionnaire dealing with family 
and educational background, leisure-time activities, 
work history, participation in community affairs, 
etc, and also the Orientation Inventory (Bass, 
1962) which yields three subscores: (1) self-orienta- 
tion—which reflects the extent to which a person 
describes himself as expecting direct rewards to 
himself regardless of the job he is doing or the way 
he affects others; (2) interaction orientation 
which reflects an individual's degree of concern with 
maintaining harmonious relationships in a supet- 
ficial sort of way at the expense of concern for the 
task; (3) task orientation—which reflects the de- 
gree of concern with completing the job, solving 
problems, working persistently, and doing the best 
job one can. The test consists of 27 items all of 
which are keyed for each of the three scales. 


Procedure 


Forty-eight of the Ss were randomly assigned t9 
the experimental group and 32 to the control group- 
Initially all Ss were mailed the biographical question- 
naire and the personality measure. In the same 
mailing the experimental group was given 
Change of Work Problem and asked to write what 
they considered to be the best solution, in as mu 
detail as possible. The control group worked on 
the Circular Assembly Problem (Maier et al., 1987) 
which is also a combination production-process~ 
human-relations type problem. Materials were origi- 
nally mailed to 82 Ss, and 81 were returned. 

The Ss were then assembled into 4-man groups, 
12 experimental groups and 8 controls. All grouPs 


each of the four solutions. The 
same 
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Thus, for the experimental group three 
solutions were offered to the problem: (1) 
dividual solution while working alae, (2 
fedividual solution after bearing and 
fe a group discumion, and (3) a 
The control group offered solutions 
of Work Problem only uncer the last 

Assuming that the comparkon of 
@xperimentals docs not show 
efect for the experimental group, & 
different analyses are powible. Using 
as degrees of freedom, comparisons can be 
see if group discussion increases oF 
quality of individual solutions to 
Using real and nominal groups as 

comparisons can be made 
experimental conditions. For the 
{independent combinations of individual 
two different group scores were used, 
individual score in the group and a 
Obtained by putting together 
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Scores were correlated for the 48 individual 
to the Change of Work Problem and 
the 48 individual-after-group-discussion soht- 
Both scorers also derived and scored com- 
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were obtained by reconciling any differ- 
between Scorer 1 and Scorer 2 to their mutual 
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TABLE 2 


Compasisons or Max Grobr Scores CLASSIFIED BY 


EXPERIMENTAL CONDITION 


AND py INTERACTION ORIENTATION SCORES 


Orientation test scores Mailed | z After Group Combined 
responses discussion consensus 
Composite score used as nominal group score 
High interaction orientation score 15.2 | 10.2 2.5 93 
Low interaction orientation score 23.5 19.0 5.2 15.9 
Combined 19.4 | 14.6 3.9 
Average score used as nominal group score 
= - —— 
High interaction orientation score 8.0 5.2 2.5 5.2 
Low interaction orientation score 8.2 9.2 5.2 7.5 
Combined 8.1 7.2 3.9 


score and the average of individual scores 
within the group. 

The superiority of the nominal group was 
significant and large, regardless of which type 
of score was used to reflect the nominal 
group’s performance. However, none of the 
main effects pertaining to the orientation 
variables and none of the interactions of these 
variables with the experimental treatments 
was statistically significant. Since the inter- 
action orientation score was the only dimen- 
sion to approach significance, only the analy- 
sis of variance for Experimental Treatment 
x Interaction is shown in Table 1. The cor- 
responding means are shown in Table 2. 
The means and sums of squares for the treat- 
ments would of course be the same no matter 
which orientation score was used. 

Correlations between the solution scores 
obtained under particular experimental treat- 
ments and the orientation data were also ex- 
amined. For the individual scores the rela- 
tionships were computed using the Pearson r, 
and for the analysis of group scores Spear- 
man’s rank-order correlation was used. In- 
dividual orientation scores were correlated 
with individual solutions obtained before dis- 
cussion and after discussion, and with the 
difference. The rankings of average group 
orientation scores were correlated with the 
ranking for nominal group composite scores 
obtained before discussion, the ranking for 
the group consensus scores, and the difference 


between these two rankings. For example, 
a positive correlation between an orientation 
score and a difference score means that the 
higher an individual or group stands on the 
orientation variable the greater the decre- 
ment in solution quality under the group 
discussion or group consensus condition. 

The analyses using individuals as observa- 
tions yielded no correlations significant at 
p= .05; however, two of the correlations 
using group rankings were statistically sig- 
nificant and several others approached sig- 
nificance. The rank-order correlations for the 
group data are shown in Table 3. Composite 


TABLE 3 


RANK-ORDER CORRELATIONS OF AVERAGE ORIENTATION 
Scores WITH NOMINAL Group COMPOSITE SCORES, 
THE ReaL Group SOLUTION, AND THE DIFFER- 
ENCE BETWEEN THE COMPOSITE SCORE 
AND THE REAL GROUP SCORE 


Solution quality scores 
Score Nominal | Real 
group group Difference 
composite | consensus 
Self-orientation 5" 45 37 
Interaction 
orientation — .68** —.37 —.46 
Task orientation 27 07 2⁵ 
Note. V = 12, 
7 05. 
** = 01. 


¢ 
Orientation scores and low interaction orienta- 
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scores for nominal groups tended to be higher 
for those groups with high-average self- 


tion scores, However, groups with high- 
average interaction orientation scores tended 
to show less of a decrement in problem- 
solution quality when they were asked to 
reach a group consensus. Somewhat sur- 
prisingly, the task orientation variable 
yielded the lowest relationships. 

No consistent relationships were found 
with any of the biographical items, 


Discusston AND CONCLUSIONS 


The results of the present study tend to 
Support the more recent research on individ- 
wal versus group problem solving cited 
earlier. Group participation and discussion 
did not generate higher scores but rather 
tended to be inhibitory. The quality of the 
group solution was inferior to the nominal 
group’s composite score and was even inferior 
to the average individual solution. Tt should 
be pointed out that such a finding is de- 
pendent on the type of problem used and the 
manner in which it was scored. Also, nothing 
has been said about the properties of the 
group situation which might have led to the 
reduction in quality scores. Perhaps new and 
better alternatives were stimulated but were 
just never expressed. 

Groups seemed to settle very early on a 
particular line of attack, and alternative work 
procedures were seldom explored in the group 
discussion. However, this did not seem to 
Occur in an atmosphere of obvious inhibition 
and threat. Most Ss knew one another on a 
first name basis and all seemed highly mo- 
tivated and enthusiastic. Also, it is doubtful 
that time limitations could explain the in- 
feriority of the group solution. A 24-hr. period 


was set aside for the experimental session 


and no time limitations were imposed. The 
absence of such a limitation was reinforced 
by the experimenter. The experimental ses- 
sions were relaxed but task oriented, and none 


of the Ss expressed any discomfort because 
of being away from his job too long. The 


_ degree of S cooperation and involvement was 
ideal, 


The results obtained with the Orientation 
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In the latter case, the effects 
quality may not be as important as the mo- 
tivational aspects of group participation. 

In addition, the present study and 


niques yields solutions which result in the real 
group being superior to the nominal group. 
The relative utility of various problem-solving 
strategies for different types of problems is an 
area which remains unexplored. 
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ENVIRONMENTAL FRUSTRATION AND CREATIVE 


PROBLEM SOLVING 


BERNARD L. HINTON? 
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Report of an empirical study of the effect of environmental frustration on 
creative-problem-solving performance. The research design is based upon a 
linear model of creative problem solving, Alternate forms of analysis are 
applied, testing the relationship between environmental frustration and crea- 
tivity under various sets of assumptions about the experimental conditions. 
It is concluded that environmental frustration does significantly reduce crea- 
tive-problem-solving performance. This is supported under a number of 
alternative assumptions about the “rest of the world” variables of personality 


and ability factors, 


Recent research on creative problem solv- 
ing has tended to focus primarily on the 
factors of the creative person (e.g., Barron, 
1953, 1954, 1955, 1957; Dentler & Mackler, 
1964; Drevdahl, 1956; Garwood, 1964; Guil- 
ford, 1959; Hitt & Stock, 1965), the creative 
process (e.g., Crutchfield, 1961; Ghiselin, 
1956; Harmon, 1956; Taylor, 1959), or the 
measurement of creativity through the evalua- 
tion of the creative product (e.g., Ghiselin, 
1958; McPherson, 1956; Taylor, Smith, & 
Ghiselin, 1959). The research which has 
been reported on investigation of the creative 
environment has been almost exclusively re- 
stricted to studies of organizational climate 
or group climate (e.g., Crutchfield, 1962; 
Fiedler, 1962; Gibb, 1951; Ziller, Behringer, 
& Goodchilds, 1962), with considerably less 
interest being shown in the micro- and psy- 
chological climate as it pertains to the in- 
dividual problem solver. p 

Maddi (1964), on the basis of his experi- 
ence, has labeled some generally held assump- 
tions about the effects of psychological stress 
as being “old wives’ tales.” One of these as- 
sumptions concerns the reduction of creativity 
in frustrating environments. Maddi argues, 
in part, that a creative person can ignore, Or 
at least overcome, such restriction and offers 


1 Now at Indiana University, School of Business. 
This article is derived, in part, from a doctoral 
dissertation completed at Stanford University. The 
author wishes to express appreciation to Charles 
W. Haley, now at University of Washington, for 
his assistance on the model development and to 
Thomas W. Harrell for his many contributions to 
the research effort. 


case examples to support his position. The 
research reported herein was designed to 
test the validity of that position; the null 
hypothesis is: Creative-problem-solving per- 
formance will not be influenced by a state 
of frustration experienced by the problem 
solver. 


PROCEDURE 


A battery of criterion measurement instruments 
was administered, comprised of an Alternate Uses 
test, a Match Problems test, and a Consequences 
test (scored for both “obvious” and “remote” re- 
sponses). Certain modifications to the published 
form of these tests were found to be necessary due 
to the pre-post-test research design. Primarily these 
required the partitioning of the instruments into two 
comparable forms. In the case of the Match Problems 
and the Consequences tests this merely required a 
physical division of the test booklets. The reliability 
of the Match Problems test was judged to suffer 
a negligible amount by such action. The reliability 
of the Consequences test had already been demon- 
strated to be unaffected, as the manual reported as 
few as eight items could be used without affecting 
the reliability, and after generation of alternate 
forms there remained 10 items in each form. 

In the case of the Alternate Uses test, however, 
a split into halves was judged to have a serious 
effect on reliability and thus a parallel set of test 
questions was deemed necessary. This generated set 
was balanced, pair-wise, against the original set 
according to several dimensions, including the nature 
of the objects listed in the test, the relative size of 
the objects, and the estimated cost of the objects. 
A sample pretesting of the response frequency of 
this set against the original set indicated com- 
parability. To further reduce the introduction of 
bias or error, the two sets were then merged, with 
each set having half of its items from the original 
set and half from the generated set. Additionally, 
the half from the generated set were the paired items 
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to those now placed in the alternate form, A random- 
sample item analysis on response frequency after 
the original collection of data indicated that the 
original and generated sets of items were com- 
parable in that no significant differences occurred 
in the mean number of responses given to the 
generated items and those given to the original items. 

The initial testing was carried out at the beginning 
of the academic year. The research Ss were entering 
Master of Business Administration (MBA) candi- 
dates in the Graduate School of Business at Stan- 
ford University. The data were collected prior to 
registration and in conjunction with the collec- 
tion of data for other on-going research. The struc- 
ture of this situation was such as to tap a situation 
where the level of frustration was probably as low 
and as randomly distributed as it is possible to 
achieve in students in the real world. At the least, 
the environment contained no identifiable sources 
of frustration. What frustration may have been 
present, from whatever source, does of course serve 
to increase the random-error term in the analysis 
presented later. 

Parallel test forms were used for the second 
administration, which was carried out about 6 wk. 
later, well into the fall quarter. Prior observation of 
MBA students had indicated that this point in 
time was for many of them a peak in their level 
of frustration as they underwent an adjustment to 
the high level of output required of them by the 
academic program. Additionally, the tests were ad- 
ministered immediately after the return of a major 
paper, a task on which many students had failed to 
achieve their expected performance level. 

Also administered were two sheets of graphic 
rating scales which were designed to measure the 
emotional state and attitudes of the testees. In- 
cluded were distractor questions intended to help 
mask the prime interest area. The second of these 
sheets presented eight scale measures on which the 
S indicated what his emotional state had been 
while taking the tests. The S’s response on the 
frustration scale was of primary interest. 

Scoring of the test battery was done after all 
of the data had been collected, but was independent 
of S’s responses on the graphic scales, (It was 
“blind” in that the quantification of the scale 
responses was the last step in the scoring procedure.) 
This was deemed necessary in that a fair degree 
of scoring subjectivity was implicit on both the 
Alternate Uses and Consequences tests. The Match 
Problems test was scored objectively. The scores 
were recorded separately on each factor, as a 
joint score on the Consequences test, and as a 
total (additive) score. Analysis was carried out on 
each of these six sets of dependent variable measures. 

The administration, in both instances, was carried 
out in large groups and in a classroom situation. 
The intent was to hold as constant as possible the 
physical setting (thus avoiding the introduction of 
extraneous stimuli or cues) and the sociological 
setting. The Ss were not informed of the nature 
of the criterion (creativity) until after the testing 
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had been completed, although the nature of the 
tests may have given some indication of this 

After the collection of the data had been ac 
complished, as described above, the raw scores on 
the various creativity measures were transformed 
into standard scores (T scores) with a mean of SO 
and a standard deviation of 10. This procedure 
rendered the scores on the alternate forms of the 
criterion instruments used directly comparable. The 
initial sample size was 226, but after the elimination 
of those Ss for whom incomplete data were received, 
and all of those whose first language was other 
than English, the final sample size was 147, 
Data were incomplete on the criterion test battery 
in only three cases. The remaining Ss eliminated 
because of incomplete data either failed to complete 
the graphic scales or gave identical responses to each 
scale. 


Tue MODEL 


This research project was designed through the 
use of a model of creative problem solving (Hinton, 
1966) which has been suggested as offering a basis 
for the comparison and design of creativity re- 
search. The proposed model hypothesized a simple 
linear relationship between the variables of person, 
environment, process, and product. Process is con- 
sidered, however, to be an intervening variable, and 
is not directly expressed in the model. The model's 
initial expression is thus: 


Cin d Air S Pin + 
y Eije 9 S +A Sti He [1] 


where C represents an index of creative-problem- 
solving performance, A an index of given ability or 
intellect factors, P an index of personality factors, 
E the psychological or emotional state, S* the 
sociological setting, and S** the physical setting. 
In each case the subscript i identifies an individual, 
the subscript j the specific variable, and & the point 
in time. 

Expressed verbally, creativity is a function of the 
person and the environment where the person 18 
represented by measures of his general level of 
ability and measures of his personality makeup 
(although these are not necessarily independent), 
and the environment is comprised of psychological, 
sociological, and physical elements. 

For the current research effort Ci is defined as 
the performance or score on the research instruments 
or tests of creative-problem-solving performance. 
Ear is defined as the measured value taken from the 
eight graphic rating scales on which S indicated 
his emotional state while completing the test battery. 
There are, therefore, a total of six j’s for Cann and 
eight Js for Eix. Within the time span and structure 
of this research effort the values of 4% Pum Sie 
and Sr, are unknown but assumed constant. That 
is, the change A 4% A Piz, A S*, and 4 Sty 
from the first administration to the second ad- 
ministration is assumed to be essentially zero. 
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4 Covariance 1 
Variate = Cn Covariate = Cys; 
Treatment Groups allocated by 
Emotional State Measure 
ACY = yE + Simple correlation: CH for By = Aw * Agi Pin ~ Pot Fin wadom 
i (Frustration) and for Ey = (Secale | with repect to Ben 
. factor) l 
Gin = 8 Cy. + Multiple cofrelation : Ag = Ani Pa = Pan 
IKT CoC Ras 
— — im N 9 


Analysis applied is not necessarily tert of model 


ng coefficient for factor in model 


Table 1 shows the model statements which may 
us be derived? and the analysis to be applied. 
It should perhaps be noted that the tests applied 
3 not specifically testing the model statements. 
That is, the author was interested in the strength 
or predictive power of the relationship rather than 
the value of the coefficient. While the model does 
nply that it is possible to accurately quantify 
the relationship, it was in fact not realistic to do 
0, in view of the nature and design of the research 
ind in spite of the fact that such coefficients could 
tadily be calculated. 


RESULTS 


The emotional state scale measures (quanti- 
fied values) were analyzed, and the inter- 
correlations shown in Table 2 determined. 
ary interest is upon the frustration 


For the derivation of these statements the reader 
zs referred to Hinton (1966). 


The asalysis tests strength of reieteomsbep raider thew 


direct statement. 
statement, The results must be interpreted accordingly. 


were perfect and if all other influencing fac- 
tors were constant in their effect from Time 


TABLE 2 
CORRELATION MATRIX—EMoTIONAL STATE SCALE MEASURES 
A Frustra- | Discon- | Discour- | Inade- n 3 | Discom- | Dissatis- 
Eecling tion tentme e falco Aggression! fort | faction 
b Frustrati 172 60) 852 434 054 562 625 
dis * 1 1123 035 041 038 “310 192 
ouragement 1,000 695 517 005 Ae 617 
equa 1,000 459 004 492 474 
e 1 r a eet 
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TABLE 3 


CORRELATION COEFFICIENTS FOR FIRST AND SECOND 
ADMINISTRATION SCORES ON CREATIVITY 
CRITERION MEASURES 


Correlation coefficient 


| 

| 
Criteri . a: | * 
Criterion measure (Cr | C- 1/ Cu- 


(i = 1) Alternate uses 640 
(7 = 2) Match problems 4899 
(7 = 3) Consequences-obvious 6104 
(j = 4) Consequences remote 4133 
(j = 5) Consequences-total 6885 
(j = 6) Total score 7147 


1 to Time 2. In view of the reliability esti- 
mates (from manual, with range from . 70 
to .86) and the assumptions concerning 
variable changes over the time period con- 
cerned, the correlations shown do indicate 
that a change in the environment (the only 
variable in the model assumed to have 
changed) had some effect on creativity. In 
order to test the significance of this, and to 
relate it directly to frustration, a more direct 
test of the null hypothesis was carried out 
using covariance analysis. 

The Ss were divided into three groups 
according to their response to the scale 
Frustration on the Emotional State Scale 
Measure. This assignment was done accord- 
ing to two schemes, the first assigning cases 
to groups according to an equal-interval sub- 
division of the frustration-scale responses, 
the second according to a ranking of the 
frustration-scale responses, and then a split 
into three groups of approximately equal 
size. Ranking ties were not split, thus a 
minor inequality in group size. These assign- 
ments are shown in Table 4. 


TABLE 4 


CASE ASSIGNMENT FOR COVARIANCE ANALYSIS 


F scale value Group assigned} No. cases 
Eija = 0.00 to 5.00 1 20 
Liye = 5.01 to 9.99 2 75 
Hijz = 10.00 to 15.00 3 52 
Eija = 0.00 to 6.20 4 51 
Ea = 6.21 to 9.99 5 44 
Bij: = 10.00 to 15.00 6 52 
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An analysis of covariance was then carried 
out on these treatment groups“ using the 
Che or creativity criterion test score on the 
second administration as the variate and the 
Cat or creativity test score on the first ad- 
ministration as the covariate. This was done 
for each of the criterion measures. The results 
of this analysis are shown in Table 5. 

Accordingly, since the analysis shows the 
differences to be significant, and the only 
variable presumed to have changed is the 
emotional state, the null hypothesis is re- 
jected for each of the criterion measures ex- 
cept the Consequences—Total score. That is, 
the conclusion is drawn that the emotional 
state, or more specifically the state of frustra- 
tion, did have an effect on creative-problem- 
solving performance. 

The next test applied evaluates the strength 
of the relationship between the change in 
the creativity criterion test score (A Ci) and 
Ej, the emotional state at the time of the 
second administration, for Ey = Frustration 
and for Eh = Frustration + Discomfort + 
Dissatisfaction. 

Summary statistics for this analysis are 
shown in Table 6. This table gives both the 
correlation coefficient and the 7? (coefficient 
of determination) for each of the measures. 
These results are significant at the .05 level 
or better for all criteria except the Con- 
sequences—Obvious. 

The next test to be carried out was that of 
the multiple relationship of Ci with Cj, and 
Eye, that is, the creative-problem-solving 
score on the second administration as ex- 


TABLE 5 


COVARIANCE ANALYSIS F RATIOS FOR 
CREATIVITY CRITERIA 


Criterion F ratio for |F ratio for 
Groups | Groups 
(Civ 1273 4,5,6 
— 
(7 = 1) Alternate uses 9.099** 9,996** 
(j = 2) Match problems 6.193** | 5.348" 
(j = 3) Consequences-obvious | 3.083“ 3.683" 
(j = 4) Consequences-remote | 37.414** | 43.049" 
(j = 5) Consequences-total 2.986 2.860 
(j = 6) Total score 14.247** | 11,511 
* <.05; F = 3,07 or more. 
** p <.01; F = 4.79 or more. 
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TABLE 6 
CORRELATION COEFFICIENTS FOR Emorion at Starz Measures axp CHANGE ix 


Creativity Cerrerion Measure Scores 


it Ey: = Frustration Ein = Aale variable 
Criterion = 
(AC) | 5 =F lie 0 
7 r r r 
— — — — — — — 
7 = 1) Alternate uses —.3019"* | 0911 3028% 0917 
(j = 2) Match problems —.2599** | 0676 | 2124° O451 
(i = 3) Consequences—obvious 1748˙ 0306 0367 0013 
(i = 4) Consequences-remote — 5094" 3242 | 3563** 1269 
(j = 5) Consequences-total —.19339¢ 0374 l —.2588°* 0670 
(j = 6) Total score — .3690"* 1362 ~ 3631** 1318 
I 


*p <.05. 
> < O01. 


plained by the parallel score on the first 
administration and the effects of the state of 
frustration indicated by the emotional state 
measure. This should theoretically give com- 
parable results to that presented in Table 6 
except that, whereas the analysis presented 
in Table 6 assumed the effect of Ein, the 
emotional state of the S at the first ad- 
ministration, to be randomly distributed with 
respect to the emotional state at the second 
administration (E2), no such assumption 
is required for this analysis. Any effect the 
emotional state had upon creativity for the 
first administration is now adjusted or ac- 
counted for in the term Cin. The summary 
statistics for this analysis are shown in 
Table 7. 

To the degree that the results presented 
in Tables 6 and 7 differ, such differences may 
be ascribed to the presence of some degree 
of frustration at the time of the first ad- 
ministration, some unknown measurement 


error, and the predictive contribution of Cn, 
which has already been seen to be from .41 to 
.71 (Table 3). For direct comparison, the data 
in Tables 6 and 7 need to be corrected for 
this. The results in both analyses do indicate 
that the state of frustration evidenced by S 
contributes a significant effect to that S’s 
creative-problem-solving performance. 

In summary, then, the data analysis has 
indicated that the emotional state of an in- 
dividual, and more particularly the state of 
frustration, does significantly influence his 
creative-problem-solving performance. This 
finding is supported by both the covariance 
analysis presented and by the multiple-regres- 
sion analysis presented. 


Discussion 


The objective of this research effort has 
been an investigation of the effects of an 
individual’s emotional state, specifically his 


TABLE 7 


MULTIPLE CORRELATION COEFFICIENTS—CREATIVITY CRITERION MEASURE SCORES FOR First AND SECOND 
ADMINISTRATIONS AND EMOTIONAL STATE MEASURES 


| Criterion (Cj;1), Eye = 3-scale variable 


Criterion (Ci), Esje = Frustration 
Criterion es R R | R 

(7 = 1) Alternate uses 7104 3025 D ne 
(j = 2) Match problems 512 Ene 6114 3738 
( Jj = 3) Consequences-obvious 6303 52 4 5618 3156 
(G = 4) Consequences-remote -1017 5008 7236 5236 
Q = 5) Consequences-total -1077 : 7707 5939 
(j = 6) Total score -1725 — 5 
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state of frustration, on creative-problem-solv- 
ing performance. 

Used as criterion measures have been three 
tests which tap four separate factors of intel- 
lectual ability believed to be basic to crea- 
tivity. Two additional criterion measures have 
also been applied, one being the total sum 
of the scores on the four factors, the other 
being the sum of two separate scores derived 
from the Consequences test. Thus a total of 
six criterion measures have been used. These 
measures, while only approximations to gen- 
eral creative abilities and not universally ac- 
cepted, do possess construct validity and have 
been fairly widely utilized. 

Throughout the Results section many of 
the analyses presented show a very similar 
pattern of level of significance across these 
six criterion measures. This pattern is not 
due to the reliability of the various measures, 
however, since it does not correspond to the 
reliability pattern as derived from the test 
manuals. At least part of the pattern ap- 
pears to be explained by the relationship be- 
tween scores. The three measures affected are 
the Consequences—Obvious, Consequences—Re- 
mote, and Consequences—Total. 

The Consequences test, it should be remem- 
bered, yields two separate factor scores. Also 
utilized as a criterion score was the sum of 
these two scores, the Consequences—Total 
score, reflecting the total number of re- 
sponses given to the test. As shown in Table 
6, the correlation coefficients between the 
creativity criterion measures and the frustra- 
tion measure are all negative except for the 
coefficient for the Consequences—Obvious 
measure. This indicates that, in general, the 
presence of frustration resulted in a de- 
crease in creative-problem-solving perform- 
ance. In the case of the Consequences—Obvi- 
ous measure, however, the correlation coeffi- 
cient is seen to be positive. The Conse 
quences-Total score, being the arithmetic sum 
of the Consequences-Obvious and Conse- 
quences-Remote measures, can thus be seen 
to have suffered from a canceling effect. 

That is, performance under a state of 
frustration relative to performance under the 
initial and presumed nonfrustrated state is 
being measured by these nonindependent 
scores. The total number of responses given 
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to the Consequences test tended to go down 
under frustration. At the same time, how- 
ever, the ratio of obvious to remote responses 
shifted even more drastically, so that the end 
result was a major decrease in remote re- 
sponses, a moderate decrease in total re- 
sponses, but an increase in obvious responses. 
In a sense the quality of the responses, as 
well as the quantity, was affected, thus the 
shift in the sign of the correlation coefficient, 
and the canceling effect in terms of the tests 
of significance on the summed variables. It 
should be noted that this applies to all of the 
analyses based on these scores, although the 
magnitude of the effect varies. 

The Consequences-Remote criterion meas- 
ure appears especially strong. This is en- 
couraging in view of the fact that this score 
measures the factor DMT (Divergent Se- 
mantic Transformations), defined as Origi- 
nality (Guilford & Hoepfner, 1963). This is 
perhaps the one single factor which most 
closely approximates the usual, but larger, 
concept of creativity. 

In view of the findings, then, strong sup- 
port is believed present for the rejection of 
the null hypothesis, and the conclusion that 
frustration does reduce problem-solving per- 
formance. The correlation between the change 
in creativity criterion scores under frustra- 
tion and the emotional state measures evi- 
dences res up to .324 in the case of the 
Consequences-Remote score. While the results 
in terms of the other criterion measures do 
not approach this level, it should perhaps be 
pointed out that this particular analysis is 
not, by its nature, extremely sensitive since 
it is not corrected in any way for the pos- 
sibility of frustration having existed at the 
time of the first administration, or the in- 
fluence of the personality variables, or any 
of the many other factors which may exert 
an influence. 4 

Analysis of the second administration 
scores as explained by the first administra- 
tion scores and the emotional state yielded 
R’s (multiple coefficients of determination 
ranging from .304 to .597, with the Total 
score criterion giving the highest coefficient. 
This indicates some stability to the effect of 
frustration—that is, it implies that the effect 
is of similar nature for each of the criterion 
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measures even though the size of the effect 
may be different. 

In hindsight it is obvious that a more 
optimal procedure than that utilized would 
have been to collect data on the level of 
frustration present in Ss at this initial point; 
this was not in fact done. Such collection 
would have freed one from the necessity of 
making any assumptions about the level and 
distribution of the emotional factors, and 
statistical corrections could have been made 
to the posttest data to adjust for any vari- 
ance found. 
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Dyad negotiation teams (homogeneous in the complexity of the individuals’ 
conceptual structure) spent several hours attempting to solve a simulated in- 
ternational conflict situation. Every participant repeatedly rated himself and 
the other participants on Stogdill’s 10 leadership characteristics. The dyad 
member with the overall highest ratings on all scales was considered the dyad's 
leader. The effect of simplicity and complexity of conceptual structure on the 
degree to which leaders display Stogdill’s leadership characteristics was analyzed 
in an ANOVA design. The interaction effect for complexity and leadership 
characteristics was highly significant. Complex leaders were rated higher on 
tolerance for uncertainty, assumption of leadership role, consideration, and 
predictive accuracy. Simple leaders were rated higher on initiation of structure, 
production emphasis, and demands reconciliation. 


In a series of statements, Driver and Streu- 
fert (1965, 1966a, 1966b) and Schroder, 
Driver, and Streufert (1967) have been con- 
cerned with complexity theory. According to 
these authors, individuals differ in the com- 
plexity of their conceptual structure, that is, 
in the degree to which they use differentiation 
and integration in information processing. The 
theory suggests that “simple” persons will re- 
late incoming information (input) to decisions 
(behavioral output) in a 1:1 fashion. They 
should rarely generate alternative interpreta- 
tions of the same information (no differentia- 
tion) and they should not consider the con- 
sequences of the varying interpretations for 
the potential decisions they might make (no 
integration). “Complex” persons on the other 
hand should tend to generate many alternate 
interpretations of informational input and 
should relate these alternative interpretations 
in various ways before reaching a (often 
tentative) decision (output). Generally, then, 
complex persons are more open to changes in 
the environment, adapt more easily, but are 
less constant and more liable to try pragmatic 
trial and error behavior (even when this is 
unnecessary). Usually, complex persons may 
be described as more strategic.“ 


1This research was supported by the Office of 
Naval Research, Group Psychology Branch, 


Differences in complexity of conceptual 
structure are not related to differences in in- 
telligence. Correlations of complexity mea- 
sures and measures of intelligence have gen- 
erally been near zero. 

The theory has generated a considerable 
amount of research. This research has served 
to support a number of the theoretical prop- 
ositions. As predicted by Driver and Streufert 
(1965, 1966a, 1966b) and Schroder et al. 
(1967), individual or group differences in the 
complexity of conceptual structure account 
for some of the differences in attitude change 
(Streufert, 1965), perception of others’ in- 
tentions and strategies (Streufert & Driver, 
1965), decision-making characteristics (Streu- 
fert & Schroder, 1965), information orienta- 
tion, information search, and information 
utilization (Karlins & Lamm, 1967; Stager, 
1967; Streufert, Suedfeld, & Driver, 1965; 
Suedfeld & Streufert, 1966), and other areas 
concerned with social interaction (Crano & 
Schroder, 1967; Sieber & Lanzetta, 1964; 
Tuckman, 1964). 

Little interest has been devoted so far to 
an analysis of the effect of complexity of con 
ceptual structure on leadership characteristics. 
According to findings of Schroder, Streufert, 
and Weeden (1964), some relationships be- 
tween leadership and complexity do exist. In 
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a preliminary investigation of leadership, these 
authors found that groups of complex indi- 
viduals change their leaders more often than 
groups of simple individuals. 

According to the Driver and Streufert and 
the Schroder, Driver, and Streufert theory, 
one would not predict that either simple or 
complex persons would make more effective 
leaders. However, one would predict that cer- 
tain forms of leadership would be more likely 
displayed by complex persons, and other forms 
of leadership would be more likely displayed 
by simple persons. For instance, one would 
predict that complex leaders would be more 
willing to work with unknown factors, be 
more empathic, and predict group success 
better (when the determinants of group suc- 
cess are complex). On the other hand, one 
would predict that simple leaders would ex- 
ceed in emphasis on work success, giving di- 
rection (rather than empathy), and calling for 
unity of purpose. Theoretically, one might 
postulate that the degree to which complexity 
or simplicity in leaders may be an asset would 
depend on the particular requirements of the 
situation with which leader and follower 
group must cope. 

Concerned with the analysis of leadership 
characteristics, Stogdill (1948, 1957, 1962) 
proposed 10 characteristics which may be 
used to describe leaders: (1) tolerance of un- 
certainty, (2) persuasiveness, (3) initiation 
of structure, (4) tolerance of freedom of ac- 
tion, (5) role assumption, (6) production 
emphasis, (7) consideration, (8) predictive 
accuracy, (9) demands reconciliation, and 
(10) representation (Stogdill, 1962). These 
10 characteristics include some for which dif- 
ferences between complex and simple leaders 
might be expected (see above). Such differ- 
ences, however, would occur only if the en- 
vironment in which group and leader operate 
is sufficiently complex, so that differences can 
emerge (cf. Schroder et al., 1967). 

The present experiment is concerned with 
an analysis of the relationship of leadership 
characteristics as described by Stogdill's 10 
leadership scales to the complexity of con- 
Ceptual structure in leaders, as measured by 
the sentence completion test (SCT) of 
Schroder and Streufert (1963) and Schroder 
et al. (1967). 
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In its present form, the SCT employs the 

following stems: “When I am not certain 
which decision to make,” “When someone 
disagrees with me it usually means,” “When 
I am with two good friends and they express 
different points of view,” “When a situation 
can be interpreted in several ways.” The Ss 
are given 3 min, to complete each stem and 
write at least three more sentences on the 
subject. Responses are scored in terms of the 
multidimensionality and interrelationships of 
the response. For instance, a response scored 
as “simple” might be: 
When someone disagrees with me it usually means 
that he is wrong. I do not take issue when I am not 
sure of my ground. I try very hard to present the 
logic of my argument to the other person. If he will 
not listen I just give up. Some people never learn. 


Other simple responses state just as categori- 
cally that, “When someone disagrees with me 
it means that I have made another mis- 
take...” 

A complex response to the same stem might 
be: 


When someone disagrees with me it usually means 
that we have had different experiences. We see the 
situation in a different way, and a long discussion is 
likely to ensue. If we both listen carefully, we will 
probably find the meaning in the other point of 
view, and where it can be applied. Often we both 
change our views somewhat. 


A complex simulated environment, the 
Tactical and Negotiation Game (TNG; 
Streufert, Kliger, Castore, & Driver, 1967) 
was chosen for this research. Leadership is 
measured in a simulated government (inter- 
nation) negotiation setting, which, because of 
the constellation of demands and offers, holds 
considerable similarity to both governmental 
and industrial negotiation situations. In addi- 
tion, this simulation situation provides suf- 
ficient environmental complexity to permit in- 
dividual differences in complexity of con- 
ceptual structure to take effect. 


METHOD 


Subjects 


Three hundred and fifty male undergraduate stu- 
dents in an introductory psychology course at an 
eastern state university were tested for complexity 
of conceptual structure with the SCT (Schroder et 
al., 1967; Schroder & Streufert, 1963). The potential 
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score range of Ss on this test varies from score 1 
(very simple) to score 19 (very complex). The high- 
est score obtained in this sample was 17. The Ss 
were selected into simple (score 1 and 2) and com- 
plex (score 10 or above) categories. Forty Ss, 20 of 
simple conceptual structure and 20 of complex con- 
ceptual structure, were selected for this experiment. 
The Ss within each of these groups were randomly 
assigned to dyad negotiation teams, so that two per- 
sons on one team would negotiate with two mem- 
bers of the opposing team. Teams were homogeneous 
in conceptual structure and negotiated with teams 
of the same level of conceptual structure. Differences 
in intelligence (College Board scores) between groups 
were not obtained. 


Setting 


The environment used for the present experiment 
was the negotiation component of the TNG de- 
scribed by Streufert et al. (1967). The topic of 
negotiations was the attempt at a settlement of an 
international conflict situation (based on the work 
of Streufert, Clardy, Driver, Karlins, Schroder, & 
Suedfeld, 1965). The negotiations, in effect, are not 
unlike potential negotiations for a settlement of a 
Vietnam or Cuba type conflict. 

The Ss were given 14 hr. to study the participants’ 
manual for the TNG.2 The manual contains con- 
siderable background information on the negotiations, 
the demands and offers to be employed by the team 
members, and potential demands and offers expected 
from the opposing dyad team (see below). All back- 
ground information in the manual was phrased in 
terms intended to persuade Ss that they were ne- 
gotiating for the “good” side. The biased presentation 
of this material was found to be very effective. 

The Ss were presented with 8 offers and 10 de- 
mands, both phrased in maximum and minimum 
terms. They were told that they were expected (by 
their government) to get the opponent team to ac- 
cept as many demands as possible, and to make as 
few offers as possible. The Ss were informed that 
their pay for participation in the game would de- 
pend directly on the number of offers and demands 
made. For instance, Ss would receive $2.00 for each 
of their maximum demands and $1.50 for each of 
their minimum demands that was accepted by the 
enemy team. In turn, they would lose $1.00 for each 
maximum offer and $.75 for each minimum offer 
made to and accepted by the enemy team. In addi- 
tion, they would lose $1.00 to $1.25 (depending on 
whether maximum or minimum) for each enemy de- 
mand they accepted, and gain $1.25 to $1.50 (depend- 


2 The manual has been deposited with the Ameri- 
can Documentation Institute. Order Document No. 
9244 from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washington, 
D. C. 20540. Remit in advance 92.25 for microfilm 
or $5.00 for photocopies and make checks payable 
to: Chief, Photoduplication Service, Library of 
Congress. The manual may also be obtained from 
the senior author. 
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ing on whether maximum or minimum) for each 
enemy offer they accepted. 

The Ss were only given general information about 
the content of enemy demands and offers. They were 
consequently unable to distinguish between maximum 
and minimum proposals put forth by the other team. 

The demands and offers to be made by the two 
teams were written so that some of them would be 
directly contradictory, in other cases the minimum 
demand of one team would match the maximum offer 
of the opposing team, and in yet other cases there 
was no relationship between the demands and offers 
of the two teams. Without the direct knowledge of 
the Ss, demands and offers were organized so that an 
even trade of demands and offers would yield each 
person on both teams a take home pay of $8.00. 
The Ss were told that they would be able to make 
anywhere between zero and $20.00. All agreements 
had to be in writing and signed by all four nego- 
tiators. 


Measurement and Leadership Characteristics 


Ten series of verbal negotiations between sets of 
two dyad teams (involving four persons at a time) 
were carried on for three successive periods of 1 hr. 
each. The Ss did not know when the final period 
would be. None of the teams had resolved their dif- 
ferences entirely after the third period, so that an 
end effect was avoided. After each of the (three) 
negotiation periods, Ss were asked to take a recess. 
The recess period was used to collect a variety of 
data. Among the measures which Ss had to complete 
individually were the 10 scales for leadership char- 
acteristics proposed by Stogdill (1962). (See above.) 
The Ss were asked to rate their teammate, them- 
selves, and the two members of the opposing dyad 
team separately on all 10 scales. Seven-point scales 
were used, marked “much like” and “much unlike” 
at the scale ends. f 

Each of the four Ss in the negotiation sessions 
was, in addition, rated (at the end of each period) 
by a trained observer. Mean ratings (across perio! 
and Ss) for each S on each scale were correlated 
with the mean ratings (across periods) obtained from 
the trained observer. The obtained correlation © 
r=.87 was taken as an indicator of interrater re- 
liability for the leadership complex. Because of the 
high correlation, only ratings by Ss were used for 
further analysis. | 

The high correlation between ratings by Ss and 
ratings by the observer serves additionally as 4 
check on the meaning of the leadership scales. Scor- | 
ing instructions defining the meaning of the cate- | 
gories were given as in Table 1. n 

By specifically scoring for the characteristics 4 
outlined in Table 1, it was attempted to avoid much 
similarity or reciprocality in scoring categories. The 
high correlation of S ratings with observer ratings 
was taken as an indicant that Ss had followed in- 
structions sufficiently well to avoid confounding © 
the scoring categories. | 

To select the “leader” of each dyad of Ss (ne- 
gotiators), the members of each dyad were cota 


* 


LEADERSHIP AND CONCEPTUAL Srxucrunt 221 


TABLE 1 


LEADERSHIP CHARACTERISTICS AND 
Scozino Instaucrions 


Oo n 
Leadership 3 á 
ee Scoring instruction 
Tolerates un- The degree to which the subject was 
certainty willing to work on any topic at 
hand although there was doubt 
that the other side would accept 
the suggestions made. 
| 
Persuasiveness |The degree to which the subject 
attempted and succeeded in getting 
his view accepted. 
Initiation of The degree to which the subject at- 
structure tempted to impose rules or organ- 


| izational procedures upon the 
negotiations. 

The degree to which the subject per- 
mitted in himself and others be- 
havior which was extraneous to 
the task at hand. (Score here 
based on interpersonal behavior 
to avoid reciprocality to “initia- 
tion of structure.”) 

The degree to which the subject 
attempted to guide the discussion 
in task areas, no matter whether 
the discussions were held under 
structured or unstructured con- 
ditions, 

The degree to which the subject 
attempted to get signed agree- 
ments, even if these agreements 
did not touch on areas where the 
negotiation teams were far apart. 


The degree to which a subject ex- 
pressed an understanding of why 
the other team was demanding 
(or offering) particular points, 
even if these points were declared 
not acceptable by the subject. 

The degree to which a subject was 
able to foretell in advance which 
agreement would be made, and 
what terms would be agreed upon. 

The degree to which a subject would 
not tolerate interpersonal (rather 
than task-based) conflict. (Em- 
phasis here placed on interpersonal 
side to avoid reciprocality with 
“tolerates uncertainty.”) 

The degree to which the subject was 
verbally active and argued pri- 
marily for the demands and offers 


of his side. (Emphasis on task 
orientation to avoid similarity to 


“assumes leadership role.”) 
PPP 


Tolerates free- 
dom of action 


Assumes leader- 
ship role 


Production 
emphasis 


Consideration 


Predictive ac- 
curacy 


Demands recon- 


ciliation 


Represents 
gri oup 


complexity was not significant. The main ef- 
fect for leadership scales (F = 62.01) and 
the interaction effect of leadership scales with 
complexity of conceptual structure (F = 
37.74) were significant beyond the 001 level. 

In view of the highly significant interaction, 
differences between simple and complex lead- 
ers on each of the 10 leadership scales were 
tested by means of the F statistic with 1/168 
degrees of freedom. The analysis-of-variance 
within-error term was used as the estimate of 
mean square error within (cf. Winer, 1962). 
Because of repeated application, and because 
of the small error mean square (below the 
“theoretical minimum” of 1.0), a limit of $ 
< .001 was set to accept significance. Sig- 
nificant F ratios were obtained for tolerance 
of uncertainty (F = 128.0), initiation of 
structure (F = 34.03), assumption of leader- 
ship role (F = 18.0), production emphasis 
(F = 63.28), consideration (F = 52.53), pre- 
dictive accuracy (F = 69.03), and demands 
reconciliation (F = 24.75). The F ratios for 
persuasiveness, tolerance for freedom of ac- 
tion, and represents group were not signifi- 
cant. The results are shown in graphic form 
in Figure 1. 

As expected, the main effect for complexity 
was not significant. It appears that leader- 
ship characteristics are displayed by both 
simple and complex leaders. Although it is 
difficult to interpret the main effect for leader- 
ship scales since the interaction effect is 
highly significant, one might suggest that this 
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Fic. 1. Mean leadership ratings on Stogdill’s char- 
acteristics for leaders of simple and complex con- 
ceptual structure. 


result may be an effect of the negotiation 
situation. For instance, secrecy and united 
action are of vital importance in negotiations, 
and great freedom of action for the follower 
cannot be permitted by the leader if negotia- 
tions are to be successful. Consequently, one 
finds that both complex and simple leaders 
are rated near the scale midpoint on this 
characteristic. Similarly, both are rated high 
in assuming the leadership role and repre- 
senting their group, in other words, permit- 
ting little (divergent) activity by their fol- 
lower. 

The highly significant interaction effect is 
of special interest. As expected, there are some 
communalities and some divergencies in the 
characteristics of simple and complex leaders. 
Complex leaders are high on tolerance of un- 
certainty, while simple leaders are rated near 
the scale midpoint. Since complex persons 
have more facilities for integrating incon- 
gruent or unstructured information, such a 
result may be expected. Similarly, it is not 
surprising that simple leaders are more con- 
cerned with initiation of structure. Stogdill 
describes this characteristic as “clarifies own 
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role and lets follower group know what is 
expected.” This again would result in less un- 
certainty, establish clear-cut roles, and permit 
greater emphasis on accomplishing the task 
as defined. In line with this argument, sim- 
ple leaders also scored higher than complex 
leaders on production emphasis. On the other 
hand, complex leaders show more considera- 
tion and demand less reconciliation than sim- 
ple leaders. Again, this would be due to their 
greater ability to integrate alternate views, 
possibly even a preference for alternate views 
that might serve to develop alternate or more 
complex strategies. In addition, complex lead- 
ers were rated as having higher predictive ac- 
curacy. This result corroborates the finding 
of Streufert and Driver (1965) that com- 
plexity is of great advantage in the perception 
of others’ strategy. The finding that complex 
Ss are rated higher in assumption of leader- 
ship role is difficult to interpret since the 
terminology is, in this case, somewhat unclear. 
Further research on the meaning and implica- 
tions of the leadership “role” seems advisable. 

One should note that there was no differ- 
ence (r = .85, .89) in the correlation of the 
trained observer’s ratings with the mean 
group ratings of complex and simple leaders. 
In other words, the differences between com- 
plex and simple leaders on these scales are 
not likely due to different uses of the rating 
scales by complex and simple persons, but 
are reflections of actual differences in leader 
characteristics. 

To check whether the ratings obtained by 
the “leader” Ss were based on actual leader- 
ship (rather than other characteristics, e. g., 
possibly those inherent in the situation), 
mean ratings were also calculated for “fol- 
lower” Ss. Mean ratings on all categories 
were below the scale midpoint (4.0), and the 
peaks and valleys of the curves were less pro- 
nounced. On the basis of this information it 
was concluded that validity of leadership 
ratings was given. 

The results suggest that, although in some 
situations leaders who are complex and lead- 
ers who are simple will do equally well, there 
are other situations requiring specific leader- 
ship characteristics in which the simple lead- 
ers may outperform complex leaders and vice 
versa. It may be of value to study the type 
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of leadership required before a 

leader is selected. In this fashion, leadership 
and consequently group performance may be 
optimized. In negotiations, for instance, it 
may be of value to use a complex leader when 
the situation is somewhat unstructured and 
frequent modification of strategies as well as 
empathy with the opposing team are re- 
quired. On the other hand, a simple leader 
might be more advantageous if more uni- 
lateral negotiations with emphasis on un- 
modified results are required. However, a 
warning should be added: Negotiations be- 
tween teams whose members are of simple 
conceptual structure may well end in a break- 
down of the negotiations as long as both sides 
are of similar strength and if there is little 
or no meeting ground. Under such conditions, 
one would expect better results by using more 
complex leaders and negotiations teams. 
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REACTION TIME AS A FUNCTION OF THE CUE 
PROPERTIES OF AN AUDITORY DISPLAY 


J. RICHARD SIMON 


University of Iowa 


An attempt was made to increase the speed of processing verbal directional 
commands by using the ear stimulated to provide a relevant directional cue. 
In a choice reaction-time task, Ss pressed the right- or left-hand key in 
response to: (1) binaural “right” or “left” commands, (2) monaural “right” 
or “left” commands in the ear corresponding to the content of the command, 
and (3) monaural pure tones. Responses to the binaural commands were 
significantly slower than to the other 2 conditions, and responses to the 
monaural commands were significantly slower than to the monaural tones. 


A basic and continuing problem in engi- 
neering psychology is that of designing dis- 
plays which effectively communicate spatial 
relationships such as direction (Fitts, 1951). 
In order to maximize the speed of informa- 
tion processing, displays should be designed 
so that they provide cues for the direct per- 
ception of relevant directional relationships. 
This occurs, for example, when the spatial 
arrangement of a visual display is made to 
correspond with that of its associated control 
(Fitts & Seeger, 1953; Garvey & Mitnick, 
1955). 

Simon and Rudell (1967) demonstrated 
that directional cues also play a role in the 
interpretation of auditory displays. Their Ss 
performed a choice reaction-time (RT) task 
which involved pressing the right- or left- 
hand key in response to the commands “right” 
or “left” which were presented to the right 
or left ear. The Ss responded significantly 
faster when the content of the command cor- 
responded to the ear stimulated (i.e., “right” 
in right or “left” in left ear) than when it did 
not (i. e., “right” in left ear or left“ in right 
ear). In other words, a delay in responding 
occurred when the irrelevant directional cue 
(ear stimulated) conflicted with the relevant 
symbolic cue (content of the command). 

The present study represents an attempt 
to increase the speed of processing verbal 
directional commands by utilizing the direc- 
tional cue provided by the ear stimulated. 
Will Ss respond faster to a display which 
combines directional and symbolic cues, that 
is, when the “right” command is always pre- 


sented to the right ear and the left“ com- 
mand to the left ear, than they will to a 
display which utilizes either directional or 
symbolic cues alone? The expectation that 
such a result might occur is based upon evi- 
dence that two simultaneous stimuli may 
summate to elicit a faster response than any 
one of the stimuli alone (Woodworth & 
Schlosberg, 1954). Todd (1912), for example, 
has shown that RT to a combination of sound 
and shock is faster than to either one alone. 


METHOD 


Apparatus. The apparatus provided a measure of 
choice RT to a series of auditory stimuli which 
were presented to S through Grason-Stadler TDH-39 
matched earphones, Two telegraph keys were 
positioned 10 in. apart on a table in front of S. He 
was instructed to depress the correct one of the 
two keys as quickly as possible after hearing the 
signal. The instructions and stimulus materials were 
recorded on tape and presented at approximately 
80 db. by a Sony TC 500 Stereocorder. Onset of the 
signal activated a Hunter Klockounter which stopped 
when S pressed the key. Depressing the key also 
signaled E as to which key (or keys) had been 
pressed. 

Subjects. The Ss were 18 male and 18 female 
University of Iowa undergraduate volunteers between 
the ages of 18 and 29 (average age 19.8). All Ss 
passed a standard audiometric screening test in whi 
tones of 500, 1000, and 2000 cps were presented to 
each ear separately. No S had a hearing loss greater 
than 10 db. at any one of the three frequencies 
tested. 

Procedure and experimental design. Each 5 per- 
formed on three blocks of trials corresponding to 
the three experimental conditions. A 3X3 Latin 
square was used to balance the order of presentation 
of the conditions. Eight males and eight females 
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were randomly assigned to each of the three 
sequences. 

In one block of trials, the stimulus was a 1000-cps 
monaural tone. The S was told that he would hear 
the tone in either his right or left ear and that he 
was to respond by depressing the co 
(right or left) key. In a second block, S was told 
that he would hear either the word “right” or the 
word “left” in both ears simultaneously and that he 
should respond by depressing the appropriate key. 
In a third block, S was told that he would hear 
the word “right” in his right ear or the word 
“left” in his left ear and that he was to respond by 
depressing the appropriate key. Each block of trials 
consisted of a predetermined random sequence of 
42 tones or commands half of which called for a 
right response and half a left response. The first 
four responses in each block, two right and two 
left responses, were practice trials. A ready signal 
was presented 2 sec. prior to each stimulus and 
consisted of the simultaneous onset of a warning 
light and a 2000-cps, 80-db. binaural tone of 500- 
msec. duration, There was a 7-sec. interval between 
commands, 


RESULTS 


Median RTs were computed for each S 
for each treatment condition. The means of 
these median RTs were as follows: monaural 
tone (directional cue)= 307 msec.; monaural 
verbal command (directional plus symbolic 
cues) = 322 msec.; and binaural verbal com- 
mand (symbolic cue)= 420 msec. An overall 
analysis of variance revealed that the differ- 
ence between these treatment conditions was 
significant; F (2/60) = 155.10, p< .001. 
None of the other main effects (i.e., order, 
sequence, sex) or interactions was significant. 

A critical difference test (p= 05) be- 
tween individual pairs of means revealed that 
RT to the binaural verbal command was 
significantly slower than to either of the 
other two treatment conditions. The RT to 
the monaural verbal command was signifi- 
cantly slower than to the monaural tone. 


Discussion 


Results indicated that Ss were slower in 
responding to the binaural verbal command 
than to either the monaural tone or the 
monaural verbal command. The explanation 
for this finding seems obvious. With the 
binaural command, S could not begin to re- 
spond until an intelligible portion of the 


word had been articulated, whereas, with the 
other two treatment conditions, the car stimu- 
lated provided a relevant cue and S could 
begin to respond immediately upon onset of 
the stimulus. It is therefore not surprising 
that the monaural verbal command (com- 
bined directional and symbolic cues) re- 
sulted in a faster RT than the binaural verbal 
command (symbolic cue). What is surprising 
is that the monaural verbal command actually 
resulted in a significantly slower RT than 
the monaural tone (directional cue). Results 
indicated clearly that Ss were utilizing the 
directional cue contained in the monaural 
verbal command. Apparently, however, they 
were not utilizing the cue as effectively as 
they might have. It was as if the availability 
of the redundant symbolic cue (content of 
command) slowed down Ss’ reactions to the 
directional cue (ear stimulated). 

Results of the present study show that RT 
to verbal commands of “right” or “left” may 
be reduced by presenting these commands to 
the appropriate ear, that is, adding a relevant 
directional cue. However, results also indicate 
that this method of conveying directional in- 
formation is not superior and, in fact, is in- 
ferior to presenting pure tones to one ear or 
the other. These results must not be used as 
a basis for generalizations concerning (a) 
the relative effectiveness of directional versus 
symbolic cues in auditory RT, or (5) the 
general effect of combining directional and 
symbolic cues. Such generalizations are un- 
warranted because of the qualitative differ- 
ence between the two kinds of stimuli (tone 
and verbal command) used in this study. 
Future research should hold this variable 
constant, for example, train Ss to associate 
different directional responses to tones of 
differing frequency and then compare RT to 
binaural and monaural presentations of these 
tones. 
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COMPARISON OF JUDGMENTAL METHODS WITH 
ENDORSEMENTS IN THE ASSESSMENT OF 
PERSONALITY TRAITS * 


IGOR KUSYSZYN 
York University, Downsview, Ontario 


The study focuses on the degree to which judgment methods are free from 
dissimulation effects as compared with endorsements. 8 structured personality 
scales from the Personality Research Form were administered under 4 in- 
structional sets—Endorsement, Desirable in Others, Frequency of Endorsement, 
and Desirable in Seli—to 94 members of 7 local college fraternities. The 
validity of both judgments and endorsements was examined by relating them 
to noninventory behavioral criteria in the form of behavior ratings. Although 
judgments were found to be relatively free from the kinds of dissimulation 
effects found with endorsements, judgments yielded significantly lower validities 


than endorsements. 


It has long been known that personality 
tests of the paper-and-pencil variety are sub- 
ject to faking. Although some tests have 
employed validity scales to detect distortion, 
an invalid profile is of little use to the in- 
vestigator who is interested in the individual’s 
personality traits. One approach to the prob- 
lem of faking, implemented in the present 
study, is to change the basic task of the 8 
from that of describing himself to that of 
making judgments about the test items. For 
example, instead of asking S to evaluate an 
item as “true” or „false“ of himself, one 
might ask him to judge how desirable he 
thinks others feel it is to respond “true” to 
the item. Thus, through the use of judg- 
ments, the purpose of the task may be 
disguised so as to minimize the operation of 
dissimulation tendencies. At the same time 
judgments may yield enough systematic re- 
sponse variance to provide an indication of 


1 This study is based on a thesis submitted by the 
author in 1967 to the Department of Psychology, 
University of Western Ontario, in partial fulfillment 
of the requirements for the degree of Doctor of 
Philosophy. The study has profited greatly from the 
author’s many discussions with Douglas N. Jackson, 
as well as from his careful reading of the manu- 
script, The able assistance of J. Edwin Boyd in 
programming the data for computer analysis is also 
gratefully acknowledged. This research was sup- 
ported in part by the University of Western Ontario 
Research Fund and in part by Grant No. 151 from 
the Ontario Mental Health Foundation to Douglas 
N. Jackson. 


the individual’s own personality (cf. Camp- 
bell, 1950). 

There have been very few studies published 
in which an attempt was made to use judg- 
ments for the purpose of assessing personality 
traits. In an early study, Rosen (1956) ad- 
ministered the Minnesota Multiphasic Per- 
sonality Inventory under two judgment sets, 
“personal desirability” and “perceived social 
desirability,” and obtained positive but low 
correlations between these sets and the stand- 
ard true-false set. Rosen concluded that the 
correlations were sufficiently low as to sug- 
gest the relative independence of each set. 
More recently, Messick (1960) factor ana- 
lyzed social desirability ratings of the Ed- 
wards Personal Preference Schedule (Edwards, 
1954) and obtained nine dimensions of social 
desirability. Jackson (1961, 1964a) suggested 
that the presence of these different “points of 
view” about desirability raises the possibility 
that the judgments of items might be useful 
in drawing inferences about the personality of 
the judge. Jackson (1961, 1964a) tested this 
hypothesis by using desirability judgments of 
a conformity scale to predict an attitude-shift 
criterion of conformity. The judgments were 
found to have a slightly higher relationship 
with the criterion (.29) than the endorsements 
(22). In a replication of the Jackson study 
by Goldberg and Rorer (1966), in which Ss 
predicted the responses of their group mem- 
bers instead of judging the desirability of the 
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items, a correlation of .30 between predictions 
and the criterion was obtained while the 
validity of the endorsements was only .06. 
However, in an earlier replication of the Jack- 
son study by Loomis and Spilka (1963), the 
validity of endorsements (.26) was superior 
to the validity of the judgments (.15). In a 
broader study of conformity, Stricker, Mes- 
sick, and Jackson (1966) related judgments 
and endorsements of an Independence-Vield- 
ing scale to four kinds of conformity criteria 
and found judgments to be better predictors 
than endorsements in the case of Ss who were 
suspicious as to the nature of the study. 
The positive results in favor of judgments 
in three of the last four studies cited above 
suggest the potential importance of judgment 
instructions in trait assessment, and the 
limited scope of these studies (i.e., only one 
kind of criterion—conformity—was predicted) 
clearly indicates the need for a broader and 
more thorough investigation of judgment 
methods. The present study will attempt to 
partially fulfill this need by using criteria 
which differ in method from questionnaires. 
These are in the form of behavior ratings by 
peers. The validities of eight personality scales 
administered under three kinds of judgment 
sets and under an endorsement set will be 
evaluated. The susceptibility of judgments 
and endorsements to dissimulation effects will 
be studied by comparing the validities of de- 
fensive and social desirability respondents 
with the validities obtained by low defensive 
and low social desirability respondents, re- 
spectively. 


METHOD 
Subjecis 


The Ss were 94 males ranging from freshmen to 
graduate students from seven college fraternities in 
the city of London, Ontario. Thirty-six lived inside 
their fraternity houses (live-ins) and 58 lived out- 
side (live-outs). They had been acquainted with one 
another for a mean period of 18 mo. and had a 
mean age of 21.9 yr., with a standard deviation of 
1.7 yr. 


Personality Inventory 


The inventory consisted of eight 20-item person- 
ality scales from the Personality Research Form 
(PRF) (Jackson, 1964b) and two 20-item validity 
scales, Social Desirability (Jackson, 1964b) and De- 
fensiveness (Jackson & Messick, 1964). The per- 
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sonality scales are as follows: Nurturance, Exhibi- 
tion, Autonomy, Play, Aggression, Achievement, Im- 
pulsivity, and Order. The scales have a median test- 
retest reliability of .82 and a median validity, based 
on behavior ratings, of 54. The social desirability 
factor was suppressed in all scales during item anal- 
ysis. Acquiescence is suppressed by balanced true- 
false item keying. The Social Desirability scale mea- 
sures the degree to which an individual responds in 
a socially desirable manner; the Defensiveness scale 
measures the degree to which a bias in self-report is 
operating. 


Trait Adjectives 


For the behavior rating task, eight matched pairs 
of trait adjectives were selected so as to represent 
the eight personality scales of the inventory (matched 
pairs instead of single adjectives were selected so 
that the reliability of the ratings could be deter- 
mined). A brief definition of the trait accompanied 
each adjective. The adjectives and the scales they 
represent are as follows: supporting, helpful (Nur- 
turance); attention seeking, entertaining (Exhibi- 
tion); independent, nonconforming (Autonomy); 
playful, fun loving (Play); aggressive, argumentative 
(Aggression); achieving, ambitious (Achievement); 
impulsive, impatient (Impulsivity) ; orderly, meth- 
odical (Order). 

The rating instructions asked Ss to rate how char- 
acteristic each trait was of each of the members of 
their fraternity group. The ratings were made on 
a 9-point scale ranging from “extremely character- 
istic” through “neutral” to “extremely uncharacter- 
istic” of the individual. 


Instructional Sets 


The personality inventory was administered under 
four instructional sets—one endorsement set and three 
judgment sets. The sets, briefly defined, are as fol- 
lows: Endorsement— Rate how characteristic each 
statement is of you”; Desirable in Others—“Rate 
how desirable each statement is assuming it is true 
of others“; Frequency of Endorsement— Rate how 
frequently people in general would answer ‘true’ ito 
the statement”; Desirable in Seli—“Rate how desir- 
able the statement is when it is applied to you.” 
All ratings were made on a 9-point scale similar to 
that used with the trait adjectives. 


Administrative Procedure 


Each fraternity group was run separately in 4 
classroom setting. The groups ranged in size from 9 
to 19 Ss. The Ss performed the adjective trait rating 
task first, then the personality inventory was ad- 
ministered successively under each of the four in- 
structional sets in the order in which they are de- 
scribed above. 


RESULTS 
Defensiveness and Social Desirability Groups 


To investigate the effects of biased re 
sponding, the total sample was split at the 
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median of the Ss’ Defensiveness scale scores 
obtained under the Endorsement set. This 
yielded a mean Defensiveness scale score of 
99 (the possible range of scores being 20- 
180) for the High Defensive group (N = 47) 
and a score of 76 for the Low Defensive 
group (N = 47). To investigate the effects of 
socially desirable responding, the total sample 
was similarly split at the median of the Social 
Desirability scale scores. This split yielded a 
mean Social Desirability scale score of 140 for 
the High Social Desirability group (N = 47) 
and a mean score of 118 for the Low Social 
Desirability group (N = 47). 


Reliability of Behavior Ratings 


The median matched-pair reliability of the 
eight pairs of trait adjectives was .88 for the 
total sample (V= 94), .83 for the live-ins 
(N = 36), .79 for the live-outs (N = 58), 
88 and .86 for the Low Defensive (N = 47) 
and High Defensive (V= 47) groups, re- 
spectively, and . 88 and .88 for the Low Social 
Desirability (M = 47) and High Social De- 
sirability (WV = 47) groups, respectively (co- 
efficients are Spearman-Brown corrected). 

The median between-rater reliability (agree- 
ment between subgroups of raters rather than 
the correspondence between matched pairs of 
adjectives) for the 16 single adjectives was 
.83 (Spearman-Brown corrected) for the total 
sample (WV = 94). 


Reliability of Inventory Scales 


The reliabilities of the judgment and en- 
dorsement sets did not differ significantly. 
For all seven groups and for all four instruc- 
tional sets, the median reliabilities of the 10 
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inventory scales ranged between .67 and 86 
(split-half and Spearman-Brown corrected). 


Comparison of Judgment and Endorsement 
Validity Coefficients 

The criterion behavior rating scores were 
calculated as follows. Each S was rated by 
his peers on each of the 16 trait adjectives. 
The mean rated score for each & for each of 
the 16 adjectives was then determined. Since 
the 16 adjectives represented eight matched 
pairs of adjectives, the means of each of the 
eight matched pairs were added, yielding 
eight criterion scores per S, each of the eight 
scores corresponding to one of the eight per- 
sonality scales of the inventory. Validity co- 
efficients were determined for each of the 
eight personality scales under each of the four 
instructional sets by correlating the Ss’ in- 
ventory scale scores with their behavior rat- 
ing scores. For purposes of taking into ac- 
count the varying reliabilities of the trait rat- 
ings and of the inventory scales, all validity 
coefficients were initially corrected for at- 
tenuation. However, since the corrected coef 
ficients revealed the same patterns of relation- 
ships among validities as did the raw coef- 
ficients, only the raw validity coefficients will 
be presented. 

Table 1 presents the validity coefficients for 
the total sample (V= 94). The Endorsement 
set yields the highest validities (Mdn r= 
40), the Desirable in Others set, the next 
highest (Mdn r = .20), the Desirable in Self 
set, the next highest (Mdn r = .17), and the 
Frequency set, the lowest (Mdn r = .06). 

For purposes of conserving space, only the 
median validity coefficients will be presented 


TABLE 1 
Vaumrry COEFFICIENTS FOR THE TOTAL SAMPLE 


Nurt Exh Auto Play Agg Ach Imp Order Mdn 
Endorsements 29 56 24 58 35 at 4 5 2 
Desirable in Others 19. 26 10 23 15 775 io a 
Frequency 19 —07 —06 02 4 5 9 11 s 
Desirable in Self 19 31 05 37 


Note. V = 94, Decimals omitte 


d. 
The os and ol significance levels of rare Tt e determined by the Wilcoxon signed-rank test : Endorse- 


Significant differences between the Endorsement 


Abbreviations: Nurt = Nurturance; 
Achievement; Imp = Impulsivity; Order = Order. 


; < .01; Endorsements—Desirable in 
ments—Desirable in Others, 2. < 02; Endorsement 19 Aran 


+P <.01. 
Play = Play; Agg = ‘Kegression; Ach = 
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TABLE 2 


MEDIAN Vatiprry COEFFICIENTS oF EIGHT PERSONALITY SCALES 
UNDER JUDGMENT AND ENDORSEMENT SETS 


- map i High Low High Low 
* b 
o N v desirability* | desirability 
— | - — 
Endorsement 47 34 37 | 32 36 48 
Desirable in Others 11 17 13 24 % 18 26 
Frequency 14 —03 —01 | 18 07 13 
07 | 28 | 07 18 
U 


Desirable in Self 17 10 


Nate. Decimals omitted. 
‘`N = 36. 


ON m58, 
‘N =47. 


for the other six groups which were studied.* 
These correlations are presented in Table 2. 
From Table 2 it is seen that in each group 
the Endorsement set yields higher validities 
than any one of the three judgment sets. The 
Wilcoxon matched-pairs signed-rank test re- 
vealed that each of the three judgment sets 
differed significantly from the Endorsement 
set in all six groups. 

When the endorsement validities of each 
pair of subgroups were compared (Mann- 
Whitney U test) live-ins obtained signifi- 
cantly higher validities than live-outs (p < 
05); the validities of the Defensiveness sub- 
groups and the validities of the Social Desir- 
ability subgroups did not differ significantly, 
When the validities of each pair of subgroups 
were compared under each of the three judg- 
ment sets (Mann-Whitney U test) no signifi- 
cant differences were found between the 
live-ins and the live-outs and no significant 
differences were found between the Social 
Desirability subgroups. However, the Low 
Defensive group obtained significantly higher 
validities than the High Defensive group 
under the Desirable in Others (p < .01) and 
under the Frequency (p < 01) sets. 

When the individual validity coefficients of 


The entire 48 X 48 correlation matrix including 
all validity coefficients and all off-diagonal cor- 
relations has been deposited with the American 
Documentation Institute. Order Document No, 9828 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washing- 
ton, D. C. 20540. Remit in advance $1.25 for 
photocopies or $1.25 for microfilm and make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 


Table 1 are examined they are seen to vary 
greatly in magnitude, especially under the 
judgment conditions. This variability indi- 
cates that certain Judgment Set & Scale Con- 
tent interactive effects must be operating. 
That is, the same item content is rated dif- 
ferently under different judgment sets. When 
the individual validity coefficients of the De- 
fensiveness subgroups were examined, certain 
Scale X Subgroup interactive effects were also 
seen to be operating, especially under the 
judgment sets. For example, for the traits Ex- 
hibition, Aggression, and Achievement, Low 
Defensive Ss obtained positive validity coef- 
ficients while High Defensive Ss obtained 
negative validity coefficients, under the same 
judgment sets. Similar interactive effects were 
found for the Social Desirability subgroups. 


Behavior Rating and Personality Scale Score 
Means 


Table 3 presents the mean behavior rating 
scores and the mean personality scale scores 
of the High and Low Defensive groups- 
These means are compared in the hope that 
they will shed some light on the pattern of 
validity coefficients obtained by these two 
groups. (It will be remembered that the Low 
Defensive group obtained significantly higher 
validity coefficients than the High Defensive 
group on two of the judgment sets se“ 
above.) When the mean scores are examined 
on the basis of a general “favorable-unfavor- 
able” trait dimension, it is seen that High De- 
fensive Ss obtain higher scores than Low De- 
fensive Ss on favorable traits and lower scores 
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TABLE 3 
Mean Trarr Ranwo axp Meur Peasowatrry Soue Scomes ros Piou aw 
4 Low Derexuve Geours 
= > —— — 
Nurt Exh Auto Play Age Ach Teg Onder 
High defensive group (V = 47) 
Trait ratings 12 9 9 10.16 110 gos Aer“ OM nna 
Endorsements 119˙ 9 95 104" as” le ow le 
Desirability in 
Others 130 100˙ 95 9 7 ni al n 
Frequency 14 %9 98 105 a 108 oa 10 
Desirability in Self | 132 100˙ 97 ow 6s 137 76 ist 
Low defensive group (N = 47) 
Trait ratings 11,96 1097 10.64 12.640" sas 12.65" 10.41 tss 
Endorsements 113* 114" 94 118˙ * 100 107 ja; 
Desirability in 
MOthers 130 107 94 102 7¹ iM 82 128 
Frequency 116 99 95 105 86 ni 9i tió 
Desirability in Self | 129 110°* 99 100* 6 137 » in 


Note.—The possible 
See Note of Table 1 for al 


score range for a trait rating score is 2-18 and the range for a personality scale score bs 20-10, 


The entries marked with asterisks indicate statistically significant differences between the High asd Low Defensive groupe 


between the scales obtained under parallel instructional sets, 
> <.05. 
> <.01, 


on unfavorable traits. For example, when the 
behavior rating scores of the two groups are 
compared it is seen that the High group ob- 
tains significantly lower scores than the Low 
group on Exhibition and Play (Play was de- 
fined as “horsing around,” always having a 
good time, etc.) and higher scores than the 
Low group on Achievement and Order. Under 
the Endorsement condition the High group 
again obtains lower scores on Exhibition and 
Play and higher scores on Achievement and 
Order, as well as obtaining lower scores on 
Aggression and Impulsivity and a higher score 
on Nurturance. However, High Defensive Ss 
do not respond much more favorably than 
Low Defensive Ss under the judgment sets: 
there were no significant differences between 
the two groups on the Frequency set, only 
one significant difference under the Desirable 
in Others set (Exhibition), and only two 
significant differences under the Desirable in 
Self set (Exhibition and Play) (see Table 3). 

Similar score differences were obtained be- 
tween the means of the Social Desirability 
subgroups but fewer of the differences were 
found to be statistically significant. 


Discussion 


It will be recalled that the major purpose 
of the study was to examine the validities of 
judgments and endorsements, especially in 
relation to the operation of dissimulation 
effects. The obtained results were in one 
sense supporting of the use of judgment sets 
for controlling response biases, but it became 
clear that much more work is needed to 
establish an unequivocal foundation for the 
use of judgments routinely in personality 
assessment. Thus, the judgment sets were 
found to be relatively free from dissimula- 
tion effects in that little evidence of biased 
judgments was found under any of the three 
judgment sets. However, the validities of the 
judgment sets were low in relation to the 
validities obtained with endorsements. En- 
dorsements yielded higher validities for all 
groups, even in the case of High Defensive 
and High Social Desirability respondents who 
endorsed items in a favorable direction. 

When the mean personality scale scores of 
the High and Low Defensive groups were 
compared under the Endorsement condition, 
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High Defensive Ss obtained more favorable 
scores than Low Defensive Ss. However, it 
was also observed that High Defensive Ss ob- 
tained more favorable behavior ratings than 
Low Defensive Ss. That is, not only do 
High Defensive individuals respond in a 
favorable direction to personality items, but 
they also seem to create, quite strongly, 
favorable impressions upon others in real life. 
This finding is important for several reasons. 
Although other factors may be operating, the 
fact that there is a correspondence between 
the favorable endorsement of personality 
items and favorable responding in real life 
(as reflected by the behavior ratings) may be 
why the endorsements in the present study 
show higher validities than the judgments 
for the Defensiveness subgroups. The fact 
that the behavior ratings of the Low Defen- 
sive group covaried with the judgments to 
a greater degree than did the behavior ratings 
of the High Defensive group would also pro- 
vide an explanation for the judgment validities 
being higher for the Low than for the High 
Defensive Ss. This follows from the fact that 
there were few significant differences between 
the judgment scores of the two groups but 
there were relatively more significant differ- 
ences between their behavior rating scores. 

The finding that High Defensive individ- 
uals present favorable impressions in real life 
poses an interesting theoretical question. Do 
these individuals create favorable impressions 
intentionally or are they unaware of creating 
them? If they are unaware of the way in 
which they present themselves, that is, if 
favorable responding is part of their “life 
style” (cf. Crowne & Marlowe, 1964), then 
it is clear that personality scales are reflect- 
ing, in part, this “life style” of positive pre- 
sentation. On the other hand, if they are 
aware of playing this favorable role, should 
one accept their scale scores or should one 
be skeptical and begin a search for their 
“real” or “true” personalities? If one commits 
himself to the definition of behavior of pri- 
mary research interest as that which is en- 
dorsed on structured tests, as the present 
researcher has done, then one must accept 
by this definition all reliable test responses 
as being indicators of “true” personality in 
the sense that each response is determined 
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by certain personality characteristics. In this 
sense one would accept the endorsement 
validity correlations of the High Defensive 
Ss as they stand. One would also be obliged 
to account for all reliable variance obtained 
under the judgment sets, including the rela- 
tively modest portion which proved to be 
related to criterion trait ratings. In this latter 
case, it is believed that the correlations be- 
tween the judgments and the trait ratings 
were small in relation to the endorsement 
validities, because the judgment tasks differ 
in nature from the endorsements. The judg- 
ment tasks, being indirect, did not require Ss 
to reveal their personality traits in the same 
sense as did the endorsements. The judg- 
ments asked for an opinion of how desirable 
an item is in oneself or in others, or how 
frequently others would endorse the item. 
One implication of the present study, there- 
fore, is that endorsement rather than judg- 
ment instructions be used if one desires a 
valid measure of personality by means of 
structured scales and trait-rating criteria. The 
relatively high criterion validities, especially 
in the case of High Defensive and High Social 
Desirability respondents, justify the use of 
endorsements. The empirical support for the 
uniqueness of judgments suggests that more 
work needs to be done in uncovering the 
properties of judgments before applying them 
to the assessment of personality traits. The 
unique properties of the individual judgment 
sets were revealed in the present study by the 
interactive effects obtained between judg- 
ments and traits. For example, under one 
judgment set the validity coefficient is high 
while under another judgment set the validity 
coefficient for the same trait is low. Similar 
interactive effects between judgments and 
scale content are reported by Jackson and 
Singer (1967). In their study Ss were found 
to attribute the more pathological traits to 
others to a greater degree than to themselves. 
Further support for the complexity of judg- 
ments comes from a study by Jackson 
(1964b). Jackson's results suggested that in 
the case of certain traits a complementary- 
needs hypothesis may be warranted. For e 
ample, Ss scoring high on the trait Dominance 
under the endorsement condition were found 
to judge Submissiveness as being desirable in 


specific item content. In this way the com- 
plementarity-of-needs hypothesis as well as a 
similarit- f- needs hypothesis could be more 
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EFFECTS OF THE TYPE OF STIMULUS EMPLOYED 
AND THE LEVEL OF SUBJECT AWARENESS ON 
THE DETECTION OF DECEPTION * 


RICHARD I. THACKRAY 2 anp MARTIN T. ORNE € 


Institute of the Pennsylvania Hospital and University of Pennsylvania 


An experiment was designed to compare detection of personal material with 
that of neutral material made relevant only in the experimental context. Each 
S acted the role of an espionage agent and attempted to conceal both his 
personal identity and certain code words he had learned. Personally relevant 
material was found to be significantly more detectable than the neutral ma- 
terial, although consequences of detection were the same for both types. In 
addition, the study attempted to provide exploratory data concerning physi- 
ological responsivity to lie-detection stimuli when Ss were unaware that 
responses were being monitored. While there was no evidence that detection 
was inferior under the nonaware condition, difficulties in achieving a com- 
pletely convincing nonaware situation suggest caution in generalizing from 


these findings. 


Previous studies in detection of deception 
have reported varying rates of detection 
which appear to be related to the type or 
nature of the stimulus situation employed. 
Lykken (1960), for example, obtained 100% 
accuracy in matching idiosyncratic material 
(mother’s name, father’s name, etc.) to a 
particular S. Somewhat lower detection rates, 
ranging from 94% (Lykken, 1959; Marston, 
1921) to 50% (Landis & Wiley, 1926), are 
reported when mock crimes are employed. 
Still lower rates, ranging from 83% (Ruck- 
mick, 1938) to 40% (Landis & Wiley, 1926), 
are found with such “neutral” stimuli as 
numbers, letters, etc. While these studies 
would suggest that the greater the degree of 
affect associated with the type of stimuli 
employed the greater the obtained detection 
rate, differences between studies in methods 


1This work was supported in part by the United 
States Army Research and Development Command, 
Department of the Army, under Research Contract 
No. DA-49-193-MD-2647. 

2 Now at the Civil Aeromedical Research Institute, 
Federal Aviation Administration, Oklahoma City, 
Oklahoma. 

8 The authors wish to express their appreciation 
to Santina Clauser and Luise Miller for their assist- 
ance in this experiment and in particular to William 
Orchard, who was Ei, and to Mary-Jo Bryan and 
George Smiltens, who served as technicians, They 
also wish to thank Frederick J. Evans, Ulric 
Neisser, Donald N. O’Connell, Emily C. Orne, and 
Peter W. Sheehan for their helpful comments 
during the preparation of the manuscript. 


of stimulus presentation, analysis of data, 
particular physiological measures used, etc., 
make valid comparisons virtually impossible. 
Since systematic comparison of different 
types of stimulus material within a single 
study has not previously been reported, a pri- 
mary aspect of the present study was to ex- 
plore the importance of the stimulus dimen- 
sion as a determinant of detection rate. 
Specifically, detection rates derived from 4 
mock crime type of paradigm similar to one 
previously used (Thackray & Orne, 1967) 
were compared with rates obtained from 
personal or idiosyncratic material. ; 
The study was also designed to provide 
some exploratory data about &'s physiolog- 
ical responsivity to lie-detection stimuli when 
unaware that his responses are being moni- 
tored. A number of studies have suggested 
that detection of deception is difficult or im- 
possible if S is not motivated to deceive of 
if there are no consequences to being detected 
(Chappell, 1929; Gustafson & Orne, 1963). 
Thus, by eliminating S’s awareness of being 
monitored, he might not at that time be at 
tempting to deceive, would therefore have 
nothing to hide, and would be difficult to 
detect. Under nonaware conditions, a differ- 
ence between personally relevant words and 
neutral words which achieve relevance only in 
the context of the experiment might be par- 
ticularly striking. For example, hearing one s 
own name is likely to result in a greater 
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physiological response than hearing other 
names independent of the detection-of-d 

tion situation. On the other hand, when decep- 
tion is not being attempted, such differentially 
greater physiological responsivity might not 
occur with material that is relevant only in 
the context of the experiment. 

The physiological response measure em- 
ployed in the present study was the galvanic 
skin response (GSR), since previous labora- 
tory studies have shown it to be the single 
best index of deception (Ellson, Davis, Saltz- 
man, & Burke, 1952; Kubis, 1962; Kugelmas 
& Lieblich, 1966; Violante & Ross, 1964). To 
use this measure under a nonaware condition, 
however, presents certain difficulties. There 
appear to be no completely adequate tech- 
niques for insuring uniform and continuous 
electrode contact without having electrodes 
physically attached to S. It was felt, though, 
that a situation might be devised in which 
electrodes were attached, but in which & 
would believe that no recordings were actually 
being taken. The situation chosen approxi- 
mated closely the procedure followed in actual 
field polygraph examinations. In the field 
procedure, each question is carefully reviewed 
with the suspect prior to the actual examina- 
tion. In the present experiment, GSRs were 
surreptitiously recorded during this review 
phase in the hopes of approximating a non- 
aware condition. 


METHOD 
Subjects 


Twenty-eight male undergraduate college students 
were recruited from local universities. All were 
volunteers and none had previously participated in 
detection-of-deception experiments. 


Procedure 


The critical stimulus material consisted of three 
common words and three personally relevant items 
of information: S’s first name, last name, and date 
of birth. Each critical stimulus was embedded 
among five similar stimuli. Prior to S's arrival, an 
assistant randomly selected one card from each of 
three sets of six cards containing names of com- 
mon birds, trees, and colors, respectively, which 
would subsequently serve as “code words” to be 
learned by a given 5.4 He also selected five last 
— — 


The actual stimuli employed were as follows: 
Colors—PURPLE, ORANGE, BROWN, GRAY, GREEN, BLUE; 
Birds—crow, PIGEON, OWL, WREN, ROBIN, SPARROW; 
Trees—oaK, HEMLOCK, ELM, SPRUCE, MAPLE, PINE. 
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names from a large list of names of similar ethnic 
origin, five first names from a list of common make 
names, and five random dates (month and day). 
The S's own last name, first name, and date of 
birth were then randomly positioned in their re- 
spective sets. The only restriction placed on the 
sets was that the first item served as a dummy in 
the interrogation and consequently could never be 
a significant stimulus. 

Each S was seen by E, who explained that the 
effectiveness of the polygraph as a lie detector was 
being studied. He was told that he was to play the 
role of an espionage agent who knows certain 
critical code words and that he would be given 
a polygraph examination subsequently by the inter- 
rogator (E.). It was further explained that, due to 
a security leak, the interrogator was in possession of 
a pool of code words as well as the names and 
birthdates of six agents. His own name and the code 
words to be learned would be contained in this 
list. To maintain his anonymity, however, he would 
be introduced as “S 27” and the interrogator would 
not know whether he (the S) was one of the in- 
dividuals whose names he had or whether he was in 
fact innocent (belonged to a control group who did 
not receive such information). The S's task was to 
convince the interrogator that he was innocent and 
that none of the code words or personal informa- 
tion had any meaning to him. It was explained to 
him that while it was difficult to deceive the 
polygraph, some highly intelligent and mature in- 
dividuals had been able to do so. The S was then 
given a booklet which was constructed to maximize 
overlearning of the three code words. 

After S had completed the initial phase of the 
experiment, he was introduced to E, as S 27. The 
Ei simultaneously gave Es the set of stimulus cards 
specific for the S. While no innocent Ss were used 
in this study, Zs was actually blind as to the relevant 
code words and personal information concerning 
the S. As S entered the room, the polygraph was in 
the process of being “calibrated.” The technician 
completed her task and turned off the instruments 
as S was being seated in a stratolounger chair facing 
away from the equipment.“ The Es treated S 
analogously to the way an interrogator might. He 
explained the basic operation of the polygraph and 
stressed the virtual infallibility of the instrument. 
He explained to S that if he were guilty the equip- 
ment would certainly register this fact, and he 
could save considerable time and trouble by con- 
fessing (no one did). The interrogator explained 
that he would go over each of the questions with 
S to make certain there was no ambiguity, surprise, 
or confusion regarding any of the questions or the 
procedure itself. The S was told that after the 
familiarization phase, the polygraph equipment 


5 The “calibration” was to acquaint S with the 
sound of the polygraph and to reinforce his belief 
that no recording was being attempted prior to 
the actual examination. 
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would be turned on, adjusted, and the examination 
itself carried out. 

Electrodes and transducers for recording GSR 
and six other variables were then attached. (The 
remaining variables will be discussed in a separate 
paper.) The leads from the various electrodes, 
transducers, and associated equipment were not 
connected to the polygraph and were left loosely 
arranged on S’s lap. One of these polygraph leads 
was actually a dummy, however, and led from one 
of several small transducer boxes located on S's 
chair. The GSR electrodes led to this box which con- 
tained the transmitter and bridge circuit of an FM 
telemetry system.6 The GSR was telemetered to a 
separate room and recorded on a separate polygraph 
elsewhere. It was explained to S that the electrodes 
had to be attached at this time in order to allow 
sufficient time for them to “stabilize” prior to the 
later polygraph examination. 

After all electrodes were attached to the S, Es 
informed him that the questions would now be 
reviewed with him. Since the purpose of the pre- 
interrogation was to familiarize him completely with 
all aspects of the polygraph examination which 
would subsequently follow, S was told not only to 
listen to the questions, but to answer each with a 
“yes” or “no” just as he would later during the 
examination, The S was then questioned on each 
word in the six stimulus sets. The items were pre- 
sented by Es as informally as possible, allowing 
approximately 6 sec. between each item to provide 
sufficient time for the telemetered GSRs to be 
recorded. Half the Ss received the code-word ma- 
terial first followed by the personal material, while 
the order was reversed for the remaining half. 

Upon completion of the preinterrogation phase, 
the various leads were connected to the polygraph and 
the equipment was adjusted by an assistant acting 
as the polygraph operator. The S was then informed 
that the examination would begin. Questions were 
asked in the same order employed in the pre- 
interrogation, and S was again asked to answer each 
with a “yes” or “no.” A 16-sec. interstimulus interval 
was used to allow for differing latencies in the 
physiological response measures employed. Follow- 
ing completion of the questions, S was disconnected 
from the polygraph and taken to Ei for a post- 
experimental inquiry. 

The data consisted of the telemetered GSRs ob- 
tained during the preinterrogation as well as dur- 
ing the “actual” polygraph examination. Since the 
telemetered data did not yield absolute values of 
resistance, the magnitude of response was obtained 
by measuring in millimeters the maximum pen 
deflection to occur within 5 sec. following the onset 
of each verbal stimulus. The response to the critical 
stimulus was ranked against the four control words 
in its respective set, and a rank of 1 was assigned 
to the largest response. Mean ranks were then de- 
termined for each § on both the code words and 
the personal items. 


6 Onyx Designs, Inc. 
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RESULTS 


Table 1 displays the mean ranks obtained 
for both types of stimulus material under 
the nonaware and aware conditions. A mean 
rank of 1 would indicate perfect detection; 
it is noteworthy that in every comparison 
personal material yields a smaller mean rank 
than code-word material. To test the pos- 
sibility of combining orders, comparisons were 
made between ranks of personal words under 
both orders of presentation, and likewise 
between code words under both orders. Sepa- 
rate comparisons were made for the nonaware 
and aware series. None of the differences 
was significant at the .05 level (Mann- 
Whitney U; two-tailed). The orders were 
then combined and the results shown in 
Table 2. As indicated in this table, Wilcoxon 
matched-pairs signed-rank tests revealed a 
significant superiority of personal words over 
code words. 

In order to determine whether detection 
rate differed as a function of awareness, chi- 
square tests were separately performed on the 
personal- and code-word distributions. Since 
the expected chance value of the mean ranks 
is 3, the number of mean ranks above and 


TABLE 1 


Mean GSR Ranxs ON Cope Worps (CW) AND 
PERSONAL Worps (PW) FoR BOTH ORDERS 
OF PRESENTATION UNDER BOTH 
NoNAWARE AND AWARE 


CONDITIONS 
Condition Order Stimulus X rank 
material 
PW 1.38 
PW CWa. 
CW 1.67 
Nonaware 
cw 1,63 
CWPW 
PW 1.23 
PW 1.46 
PW-CW 
CW 1.70 
Aware 
CW 1.55 
CW-PW 
PW 1.42 
aN = 13, 
bN =15, 
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TABLE 2 


Mean GSR Ranks ox Counen Cong Worns 
(CW) axb Persoxar Worns (PW) vr 
NONAWARE AND AWARE CONDITIONS 


son dtiinn | Stimulus 
Condition 1 K rank x T 
| PW 1.30 26.03 
Nonaware 26.5** 
CW 1.65 222 
PW 1.44 21.33˙ 
Aware 38.87 
CW 1.62 26.0355 
Note. — N = 28. 
* p <.025, one-tailed test. 
** > <.01, one-tailed test. 
* < 001, 


below this value was determined for each 
distribution and evaluated against the ex- 
pected frequencies. As Table 2 indicates, the 
four chi-square values were significant, in- 
dicating better than chance detection under 
both awareness conditions. To test for any 
relative superiority of the two conditions, a 
comparison was made between ranks of per- 
sonal words in the nonaware and aware series. 
Likewise, a similar comparison was made be- 
tween ranks of code words in both series. 
Wilcoxon tests yielded nonsignificant z values 
of 1.09 and .09, respectively. 

During the postexperimental inquiry each 
S rated how detectable he thought he was 
on the three personal-word and three code- 
word items. Mean ranks for these two types 
of stimulus material were determined for each 
§, and the difference between the distributions 
tested with a sign test. Results indicated Ss 
felt themselves to be more detectable on the 
personal-word material ( < .001). 


Discussion 


Previous research has focused on the con- 
sequences of detection and S’s motivation 
to deceive as significant variables affecting 
the likelihood of differentially increased physi- 
ological responses (Gustafson & Orne, 1963, 
1965). In the present study, responding to 
either personal items or code words would 
lead to detection. The S was aware that a 
response to either type of stimulus would have 
the same consequences; that is, he would 
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reveal himself to be a guilty person. Yet, the 
results indicate that detection of personally 
relevant material is superior to relatively 
neutral stimuli which are made relevant only 
in the context of the experiment. If S's mo- 
tivation to deceive and the consequences of 
deception are held constant, however, it seems 
reasonable that personal material would be 
more likely than code material to evoke some 
form of emotional response. These findings 
are not inconsistent with previously reported 
results that situational factors have marked 
effect on detection rates, since significant 
detection was achieved with neutral stimuli 
made relevant in the experimental context. 
Thus, one would expect a decrease in detec- 
tion rates with both kinds of stimuli in a situ- 
ation where motivation is markedly reduced. 
The findings do suggest, though, that situa- 
tional determinants alone are not sufficient to 
account for the obtained detection rates. If 
detection were only a function of its con- 
sequences, personal material would not have 
been detected better than code-word material. 

The differential response between personal- 
word and code-word material observed in the 
present study is consistent with the findings 
of Oswald (1962, pp. 47-50) that Ss during 
sleep are more likely to rouse to their own 
name than to other stimuli presented at equal 
volume even if motivating contingencies are 
associated with other stimuli, though mo- 
tivating contingencies do affect the prob- 
ability of arousal to previously neutral stimuli 
during sleep. Whether one wishes to con- 
ceptualize the observable difference of re- 
sponse to highly relevant personal material 
(such as one’s name) as due to past differ- 
ential affective conditioning or to differences 
in meaning is largely a matter of semantic 
preference. That personally relevant material 
tends to evoke a differential affective re- 
sponse has also been shown in quite a different 
context in the work of Wolff (1943). He 
found that individuals gave a greater emo- 
tional reaction to their own forms of self- 
expression even when these were not recog- 
nized as such. The detection-of-deception 
paradigm used here is one where S’s physio- 
logical response to a particular stimulus is 
compared with that to other similar stimuli. 
In past research (Gustafson & Orne, 1963), 
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it has been shown that previously neutral 
stimuli can be made relevant to the experi- 
mental situation and yield differentially 
greater physiological responses. The previous 
work by both Oswald and Wolff has shown 
that personally relevant stimuli yield greater 
affective responses than neutral stimuli in a 
situation other than a detection-of-deception 
experiment. The authors of this study would 
interpret their findings to suggest that the two 
sources of increased physiological respon- 
sivity, situational and preexisting personal 
relevance, may each independently contribute 
to detection. 

The Ss rated themselves as more detectable 
on personal than on code material in the post- 
experimental inquiry. A clear interpretation 
of this finding is difficult. They may have 
perceived themselves as responding to per- 
sonal items during the polygraph examination 
or they may merely have expected that per- 
sonal material should be more detectable than 
code words. Further research is needed to 
clarify the results. 

In designing the present study, it was 
hoped that the nonaware condition would 
allow a comparison between personal- and 
code-word stimuli in a situation where, as far 
as the S knew, a physiological response would 
not have the consequence of detection. Under 
these circumstances one might expect that 
the code words would fail to yield a greater 
physiological response than their appropriate 
control stimuli, whereas this might not be 
the case with idiosyncratic or personal ma- 
terial. This difference in relative response 
to the two classes of stimuli might become 
smaller or disappear entirely under the aware 
condition of interrogation, since at that time 
the S would be motivated to attempt to con- 
ceal his physiological responses to both code 
words and personally relevant words. 

The actual findings indicated no significant 
differences between these two conditions. 
Personally relevant words yielded differ- 
entially greater physiological response than 
code words in both conditions; however, 
code words themselves were identified as 
readily in their nonaware condition as in their 
aware condition. It is of course possible that 
Ss did not believe appearances and thought 
that somehow they might still be monitored 
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(as indeed they were) during the nonaware 
condition, Not only was every possible pre- 
caution taken to minimize suspicion, but very 
careful postexperimental inquiry was carried 
out on this issue. Only one S appeared to 
have any suspicion in this regard; most Ss, 
however, verbalized feelings of discomfort 
during the nonaware condition. They in- 
dicated they felt strange having all the elec- 
trodes attached to them and anticipating the 
forthcoming polygraph examination. Some Ss 
even indicated that if they had been recorded 
during this initial period detection probably 
would have been easier than during the actual 
examination. It would seem that having a 
great many electrodes attached and expecting 
the forthcoming polygraph examination 
created a condition for most of the Ss that 
was similar to that of the actual interrogation. 
It should be emphasized that Ss did attempt 
to deceive E during the preliminary part of 
the examination. Further work will be needed 
to clarify whether the augmented GSR as- 
sociated with the code words under the non- 
aware condition was a function of the S’s 
motivation to deceive E independent of his 
belief that records were being obtained or 
whether it was related to the number of 
electrodes that were attached to S and the 
special situation in which he found himself. 
A definitive answer about the effect of the 
awareness of being monitored on physiological 
responsivity will require considerably more 
investigation, and great care should be taken 
in generalizing from the findings reported 
here. 
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OCCUPATIONAL VALIDITY OF THE GENERAL 
APTITUDE TEST BATTERY 
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20 yr. of occupational validation on the General Aptitude Test Battery (GATB) 
are summarized in terms of average validity with (a) job proficiency and train- 
ing criteria and (b) concurrent and longitudinal studies. These 4 categories of 
studies are investigated from the standpoint of both single and multiaptitude 
prediction. The relative efficiency of many of the 9 GATB aptitudes in pre- 
dicting criteria is dependent on whether job proficiency or training criteria are 
used. Longitudinal studies tend to have higher-aptitude criterion correlations 
than concurrent studies. The median validity of batteries of GATB aptitudes 
tends to be higher with studies using training criteria or the longitudinal design 
than the validity of batteries using job-proficiency criteria or the concurrent 
design. The summary is based upon 424 studies involving over 25,000 em- 


ployees, applicants, trainees, and students. 


In his recent book, Ghiselli (1966) summa- 
rizes the success of researchers in predicting 
occupational performance with aptitude tests. 
Ghiselli’s book provided the impetus for this 
article which summarizes the United States 
Employment Service (USES) research results 
in terms of the average validity of each Gen- 
eral Aptitude Test Battery (GATB) aptitude 
against job proficiency and training criteria 
and in longitudinal and concurrent validation 
studies. The results obtained when batteries 
of GATB aptitudes are developed with each 
of these two types of criteria and two types 
of designs are also compared. The summary 
is based on 424 studies involving over 25,000 
employees, applicants, trainees, and students. 


BACKGROUND 


The USES test research program, initiated 
in 1935, led to the development of the USES 
General Aptitude Test Battery (GATB) pub- 
lished in 1947. Occupational test development 
analysts are studying aptitudinal require- 
ments, as shown by the GATB, for successful 
on-the-job performance in 36 state employ- 
ment services affiliated with the USES. To 
date, GATB validity studies have been con- 
ducted on over 500 occupations. Although 
USES aptitude test research has taken several 
new directions in recent years (Droege, 1966; 
Dvorak, 1965; Dvorak, Droege, & Seiler, 
1965), such as the development of literacy 
skills and nonreading aptitude tests, and 
achievement and proficiency tests are being 


developed, the validation of the GATB for 
specific occupations is still the major com- 
ponent of the USES test research program. 

The nine aptitudes measured by the GATB 
are General Learning Ability (G), Verbal Ap- 
titude (V), Numerical Aptitude (N), Spatial 
Aptitude (S), Form Perception (P), Clerical 
Perception (Q), Motor Coordination (K), 
Finger Dexterity (F), and Manual Dexterity 
(M). Each aptitude is scaled to a standard- 
ized mean of 100 and standard deviation of 
20 based on a general working population 
sample (United States Department of Labor, 
1967). 

Each of the research studies summarized 
below resulted in a specific occupational test 
battery composed of minimum cutting scores 
on two-four GATB aptitudes. These test bat 
teries are currently in use by approximately 
1,600 local offices of the state employment 
services. (Information on each of these studies 
as well as a historical and technical descrip- 
tion of the GATB are available elsewhere 
[United States Department of Labor, 1967]-) 
Data on: (a) older studies which have been 
replaced by new validations, (5) studies 
which did not yield norms, and (c) studies 
which yielded norms considered suitable for 
single plant use only (because of small sample 
size) are not included in this summary. 5 
is estimated that 10% of USES studies f 
to yield norms.) While USES studies hav? 
employed numerous types of criteria, rating 
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by supervisors or instructors have been used 
most frequently. 


PROCEDURE AND RESULTS 
Job Proficiency versus Training Criteria 


The median correlation was computed for 
each aptitude for the 74 USES studies in 
which a training criterion had been used. 
Similarly, the median correlation was com- 
puted for each aptitude in a randomly se- 
lected 20% sample of the 345 USES studies 
in which a job-proficiency criterion was used. 
Grand medians were computed for each type 
of criterion. These results are shown in 
Table 1. 

Table 1 shows that the typical aptitude- 
criterion correlation obtained in USES studies 
is approximately .22 regardless of the type 
of criterion used. However, Table 1 also 
shows that the cognitive aptitudes (G, V, N, 
S) typically correlate to a higher degree with 
training criteria than they do with job-pro- 
ficiency criteria. The additional amount of 
training variance accounted for is greatest for 
Aptitude G (approximately 9%) and least 
for Aptitude S (approximately 3%), with 
Aptitudes V and N showing about the same 
additional amount (approximately 5%). On 
the other hand, the manipulative aptitudes 
(K, F, M) tend to correlate lower with train- 
ing criteria than with job-proficiency cri- 
teria. These correlations reflect differences in 
the amount of criterion variance accounted 
for of 24% (Aptitude F) and 34% (Ap- 
titudes K and M) in favor of the job pro- 
ficiency. The perceptual aptitudes (P and Q) 
tend to have above the same level of validity 
With either job-proficiency or training criteria. 
(It should be noted that all GATB aptitudes 
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are typically included in USES research re- 
gardless of the type of criterion.) 


Concurrent versus Longitudinal Validity 


Ghiselli's calculation of the average valid- 
ities obtained with job-proficiency and train- 
ing criteria was only an aside in his attempt 
to summarize the predictive power of differ- 
ent categories of aptitude tests and the pre- 
dictability of different categories of jobs. 
However, it would have been interesting if he 
had calculated the average validity of another 
classification of studies—concurrent versus 
longitudinal. Longitudinal validation is usu- 
ally held to be preferable from the standpoint 
of experimental design because it yields “pre- 
dictive validity,” but because of practical con- 
siderations much of the USES research and 
much of the occupational validation done else- 
where is on a concurrent basis. 

The concurrent-longitudinal validity com- 
parison is related to the job-proficiency—train- 
ing criteria comparison because in USES re- 
search many studies utilizing training criteria 
are longitudinal. However, studies utilizing 
job-proficiency criteria are both concurrent 
and longitudinal. The median correlation was 
computed for each aptitude for the 72 USES 
studies conducted according to a longitudinal 
research design. Similarly, the median cor- 
relation was computed for each aptitude for 
a randomly selected 20% sample of the 353 
USES studies conducted according to the con- 
current research design. These results and the 
associated grand medians are reported in 
Table 2. 

Table 2 shows that the typical aptitude- 
criterion correlation obtained in USES longi- 
tudinal studies is approximately .22 while the 


TABLE 1 
Mepran VALDITIES or GATB APTITUDES 


Type of criterion | No. studies 


e 
Job proficienc 64-708» 25 19 21 
Training x 67-715 39 | 30 | 31 


Aptitude 


Grand 
Mdn 
8 P Q K F M 
17 23 20 23 22 23 22 
25 22 22 14 16 14 22 


4 20 differ slightly by aptitude. 


» 20% sample of all studies using a job-proficiency criterion. 
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TABLE 2 
Mt VaLmrres or GATB APTITUDES 
— = — = — c — 
| Aptitude 1 
Design of study No. studies — — — - = 
| 6 |x S sler ARAE 
Concurrent 66-69%-b 27 22 24 18 15 21 | 16 19 | 20 | 20 
Longitudinal 71-72" 34 22 28 26 27 22 | 19 A | 22 22 
8 Ns differ slightly by aptitude. 
> 20% sample of all concurrent studies. 


typical correlation is .20 in concurrent studies. 
Eight of the nine aptitudes also show slightly 
higher validities with the longitudinal design, 
while there is no difference on Aptitude V. 
The largest differences in Table 2 indicate that 
an additional 5% (Aptitude P), 34% (Ap- 
titude S), and 44% (Aptitude G) of the 
criterion variance is predicted in longitudinal 
studies than is predicted in concurrent studies. 
The slightly higher validities obtained in 
longitudinal studies are not due to a greater 
range of talent in this group as the medians 
of the aptitude standard deviations in the 
studies shown in Table 2 are either the same 
(S, K, F) or 1 point higher for the concurrent 
studies. The grand median standard devia- 
tions are .17 for concurrent studies and .16 
for longitudinal studies. 


Predictor Combinations 


One consideration which Ghiselli omitted 
was the average validity obtained when sev- 
eral predictors are used. In the typical study, 
the multiple correlation technique is used to 
determine the best possible weighting of pre- 
dictors. As pointed out by Dvorak (1956), 
the USES considers the multiple cutoff method 
to be superior to the multiple correlation 
method for USES occupational validation. 
Since that time USES has begun using the 
phi coefficient rather than the tetrachoric 
correlation to report the relationship between 
the variable dichotomies (high“ or “low” 
on the criterion versus “pass” or “fail” on a 
battery consisting of minimum qualifying 
scores on two-four aptitudes). 

Table 3 shows the median phi coefficients 
obtained in USES studies when divided into 
the following categories: job proficiency and 
training, and concurrent and longitudinal. 


The difference between the median phi coef- 
ficient for the concurrent type of study (.40) 
and the longitudinal type of study (.45) is 
greater than the difference between job- pro- 
ficiency criterion studies (.40) and training 
criterion studies (.42). These results are gen- 
erally consistent with those shown in Tables 
1 and 2. 


Additional Analyses 


Two preliminary analyses were performed 
in an attempt to relate USES results to differ- 
ences between studies with job proficiency and 
training criteria reported by Ghiselli. 

Ghiselli reports grand average validities of 
10 with job-proficiency criteria and 30 with 
training criteria. Ghiselli’s figures appear to 
be based upon average correlations obtained 
in 107 matched sets of studies in which both 
job-proficiency and training criteria had been 
collected. The few USES batteries in whi 
both job-proficiency and training criteria had 
been collected on the same occupation (If 
matched sets) were similarly summarized 
The results showed a median correlation of 2 
for all aptitudes with job-proficiency crit 
(97 correlations) and .26 with training cri- 


TABLE 3 
Mep1an Varmirty or Batterms or GATB A! 
Type of study Mdn phi coefficient 
Job proficiency (N = 345) 40 
Training (N = 74) 42 
Concurrent (N = 353) 40 b“ 
Longitudinal (V = 71) 45 i 


Note.—Five studies in which a combined job-profici j 
training criterion was used were eliminated from tl 5 
proficiency-training comparison but included in the ¢ 
rent-longitudinal comparison. s 
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teria (93 correlations). Although the difer- 
ence between the two median validities is 
slight, it is in the same direction as that re- 
ported by Ghiselli. 

Before all of the data upon which the 
medians reported in Table 1 and 2 were 
readily available, an exploratory study was 
conducted. In this study, the highest of the 
nine aptitude-criterion correlations obtained in 
each of 282 USES studies with a job-pro- 
ficiency criterion and 48 studies with a train- 
ing criterion (a total of 330 studies) was 
singled out and the median of these distribu- 
tions was obtained as a method for comparing 
training and a job-proficiency criteria. The re- 
sults showed a higher median validity for 
studies with a training criterion (44) than 
for studies with a job-proficiency criterion 
(.38). A parallel study was conducted with 
these data to compare median correlations 
from concurrent and longitudinal studies. 
Similar results were obtained. (Longitudinal 
studies yielded the higher validity.) 


CONCLUSIONS 

Cognitive aptitudes (G, V, N, S) are more 
useful for predicting training criteria than for 
predicting job-proficiency criteria. Manipula- 
tive aptitudes (K, F, M) are more useful for 
predicting job-proficiency criteria. However, 
considering the grand median validity for all 
aptitudes, the same level of prediction (.22) 
is obtained regardless of the type of criterion. 
The difference in the types of criteria which 
the cognitive and manipulative aptitudes 
predict best points to the need for a mul- 
tiple-hurdle criterion for many validation 
studies. Different aptitudes (or other pre- 
dictors) may be correlated with success in the 
book-learning and the work performance 
aspects of learning the same job. A multiple- 
hurdle criterion has been used in some USES 
studies where it was recognized that the job- 
performance criterion could not be considered 
the sole measure of occupational success. 

Most aptitudes (especially G, S, and P) 
tend to yield better validity in longitudinal 
studies than in concurrent studies. This may 
mean that validity coefficients based upon 
concurrent studies tend to give underestimates 
of the predictive validity of a test used in 
selection, ignoring of course the possibility of 


time, the range of talent in the research sam- 
ple may be no different than that which would 
be 


sooner, concurrent study. Procedures are being 
instituted in USES research to try to identify 
individuals who do not remain in the research 
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sample to applicant sample. 
Plans are now being made for additional 
analyses of USES data which will be similar 


1. Methods of criterion improvement (about 
which much is said but little is done). 

2. Use of moderator variables (recent work 
by Hobert & Dunnette, 1967, is encouraging), 

3. Use of variables such as interest tests 
and biographical inventories in addition to 
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aptitude tests in the prediction equation (in- 
terest tests offer a possibility for a significant 
and early increase in validity coefficients only 
if a systematic and acceptable way of using 
interest tests in routine selection situations 
can be derived). 


None of the three areas of research men- 
tioned above is new or unexplored, but they 
have not received anywhere near the same at- 
tention as aptitude test research either by 
USES or other researchers. USES is, now 
moving into a research program which will at- 
tempt to improve criteria and explore the use 
of moderator variables and nonaptitude pre- 
dictors. 


TECHNICAL NOTES 


The median correlation is used to summa- 
rize the USES data rather than the weighted 
numerical average used by Ghiselli because 
of the computational problem involved in 
transforming phi coefficients for purposes of 
averaging. However, the weighted numerical 
average was computed for a number of sub- 
samples of aptitude-criterion correlations and 
was found to be approximately the same as 
the median. 

Although a few studies are based on sam- 
ples of less than 50 individuals (older studies 
or cross-validation studies) and a few studies 
are based on samples of several hundred in- 
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dividuals, the majority of the studies (about 
80%) are based on Ns of 50-75. The bound- 
aries of this range are dictated at the lower 
end by a long-standing USES requirement 
that norms must be based on samples of at 
least 50 in order to be used nationally and at 
the upper end by the practical problem of 
trying to find more than 75 people performing 
one specific job. 
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SCALES FOR RATING THE TASTE OF WATER 


WILLIAM H. BRUVOLD 
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Hedonic, quality, action tendency, and quality-action tendency rating scales, 
each designed to evaluate the taste of domestic water, were constrected. Near 
linear relations were found between equal appearing and secceuive leterval 
scale values for the items each scale. Average relishility coefficients 
for scales ranged from .62 to 77 in the laboratory work here reported. Mean 
scale ratings of sodium solutions were found to differ significantly by anion and 
by concentration level on each scale. In consumer survey research, the quality 


rating scale was found to correlate A) with previously developed attitude 
scales, and to be significantly and linearly related to the total amount of com- 
mon mineral present in a water supply. Use of the rating scales to develop 
limiting standards for minerals, and for other constituents commonly found 


in domestic water, was discussed. 


A research project has been initiated (Bru- 
vold, Ongerth, & Dillehay, 1967; Dillehay, 
Bruvold, & Siegel, 1967) which is intended 
to produce objective data that can be used to 
recommend limiting standards for common 
mineral content in domestic water. The de- 
sired standards are intended to insure the 
potability of water served to the public. Lack 
of research regarding the relationship be- 
tween amount of common mineral content in 
water and potability has prevented the United 
States Public Health Service (1962), and re- 
sponsible state agencies, from establishing 
definitive standards for these minerals in 
drinking water. The absence of definitive 
standards means that new public water supply 
permits may be issued for unpotable waters. 
This problem is particularly acute in rapidly 
growing population centers located in the 
southwestern United States. Also, the absence 
of the needed standards serves to impede the 
action necessary to provide all consumers with 
fully potable water for daily consumption. 

_ The present report outlines the construc- 
tion and evaluation of four rating scales that 
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can be used to describe quantitatively the re- 
lation between common minerals, or other dis- 
solved constituents, and the taste of water. 
Three of the scales were new, and one repre- 
sented a refinement of a scale used in an 
earlier work (Bruvold & Pangborn, 1966). 
Specifically, the present paper has three major 
aims. The first is to describe procedures used 
to construct the four scales, The second is to 
report results from a laboratory study de- 
signed to evaluate the four scales. The third 
is to report results of consumer surveys de- 
signed to evaluate the quality rating scale. 


Scare Construction 
Combination Scale 


A rating scale constructed by rank ordering 
statements used by Ss to describe the taste 
of water in a free response setting was em- 
ployed in initial laboratory research (Bruvold 
& Pangborn, 1966). This rating scale con- 
tained 14 items, each item consisting of a 
quality statement in combination with an 
action tendency statement. These items were 
the first to be scaled. 


Method. The 14 combination items were randomly 
arranged on a rating form with 81 additional items 
describing the taste of water in general terms. All 
95 items were presented as the third part of a 
mimeographed booklet. The first part of the booklet 
contained 105 adjectives evaluating the taste of 
water, while the second part contained 73 statements 
generally evaluating water quality. The three sets of 
items, minus the combination statements dealt with 
here, formed the item pools from which three at- 
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titude scales have been constructed (Dillehay et al., 
1967). 

The items were judged by 44 Ss (Dillehay et al., 
1967). Judgments were made on the usual 11-cate- 
gory scale employed in the method of equal ap- 
pearing intervals; standard instructions associated 
with this method were used (Edwards, 1957). All 
Ss judged all items in the same order. 

Results. The median category assignments 
and Q values were determined for each com- 
bination item using standard analytical pro- 
cedures (Edwards, 1957). Also, means and 
standard deviations for each item were de- 
termined by computer using the iterative 
least-squares method of successive intervals 
developed by Diederich, Messick, and Tucker 
(1957). Weights of 2, 1, and 0 were employed 
in the latter analysis with cutoff points of 
0.7 and +1.4. The relationship, over all 
items, for equal appearing and successive in- 
terval values, was very nearly linear. 

Nine items were selected from the total 
group so that scale distances between succes- 
sive items were approximately equal. Items 
forming the combination scale are shown in 
Table 1; scale and dispersion values are 
listed in the last two columns of Table 2. 


New Scales 


Three new rating scales were constructed 
after the work on the combination scale had 
been completed. One scale contained hedonic 
statements, one quality statements, and one 
action tendency statements. The three scales 
constructed were analogous to those described 
by Schutz (1965) for use with food products. 
The procedures employed to rate statements 
were those suggested by Torgerson (1958) in 
a discussion of subjective estimate methods. 


Method. All items were phrased to refer directly 
to water. Nine hedonic items were constructed by 
rephrasing items from the scale used by Schutz 
(1965) ; 23 new hedonic items were also constructed. 
Thirty-four quality and 18 action tendency items 
were constructed using the same free responses to 
water samples employed to develop the original 
combination scale. An attempt was made to cover 
systematically the entire range of reaction to the 
taste of water in preparing each group of items. 

All items from each group were typed, one to a 
card on small plain white cards and labeled, at ran- 
dom, with two digit numbers. The three groups of 
cards were presented, in a random order, to 53 Ss, 
all of whom were employees of the California State 
Department of Public Health. The Ss placed each 
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item into 1 of 11 categories. Standard equal appear- 
ing interval instructions were used. The Ss worked 
alone to complete the judgment task after receiving 
appropriate instructions. Item order was newly ran- 
domized for each group of cards before an S per- 
formed the judgment task. 

Results. The judgments of three Ss were 
discarded due to an apparent reversal of the 
prescribed method of assigning statements to 
categories. Median and Q values (Edwards, 
1957), and means and standard deviations 
(Diederich et al., 1957), were determined 
separately for each set of items employing 
the judgments of the remaining 50 Ss. Weights 
of 2, 1, and O were used with cutoff points 
of +0.8 and +1.6 in the successive intervals 
analysis. Plots made separately for each set 
of items showed near linear relations between 
successive and equal appearing interval values 
in all three cases. The plot for hedonic items 
is shown in Figure 1. Median, mean, Q, and 
standard deviation values for all 98 statements 
are available (Bruvold, 1967). 

Quality and action tendency scales were 
constructed to provide approximately equal 
distances between items. Items for the hedonic 
scale were selected to match those forming 
the scale commonly used in the sensory evalua- 
tion of foods (Schutz, 1965) even though the 
distance between successive items was not 
approximately equal. The items comprising 
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Fic. 1. Plot of equal appearing and successive ma 
terval scale values for standard hedonic items. 
(Circles represent Jones and Thurstone, 1955, data. 
Filled squares represent items chosen for the water 
taste hedonic scale. The lines were fitted by eye.) 
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TABLE 1 


Hedonic scale 


. Llike this water extremely. 

. I like this water very much. 
like this water moderately. 

. I like this water slightly. 

. I neither like nor dislike this water. 
. I dislike this water slightly. 

I dislike this water moderately. 

. I dislike this water very much. 

. I dislike this water extremely. 


Action Tendency scale 


1. I would be very happy to accept this water as my 
everyday drinking water. 


2. I would be happy to accept this water as my 


everyday drinking water. 


3. Iam sure that I could accept this water as my 


everyday drinking water. 


4. I could accept this water as my everyday 


drinking water. 


5. Maybe I could accept this water as my everyday 


drinking water. 

6. I don’t think I could accept this water as my 
everyday drinking water. 

7. I could not accept this water as my everyday 
drinking water. 

8. I could never drink this water. 

9. I can’t stand this water in my mouth and I 
could never drink it. 


each scale are shown in Table 1. Scale and 
dispersion values are listed in Table 2. 


LABORATORY STUDY 


Research in a laboratory setting was per- 
formed with the four scales in order to ob- 


9. This water has a horrible taste 


Ixus rox tre Four Taste Scates 


2. This water has a very good taste. 

J. This water has a good taste. 

4. This water has a slightly good taste, 
S. This water has a neutral taste 

6. This water has a slightly bad taste 
7. This water has a bad taste. 

8. This water has a very bad taste. 


Combination scale 


1. This water tastes real good. I would be very happy 
to have it for my everyday drinking water. 

2. This water tastes good. I would be happy to have 
it for my everyday drinking water. 

3. This water has no special taste at all. I would be 
happy to have it for my everyday drinking water. 

4. This water seems to have a little taste. I would be 
satisfied to have it for my everyday drinking water. 

5. This water has a mild bad taste. I could accept it 
as my everyday drinking water. 

6. This water has a fairly bad taste. I think I could 
accept it as my everyday drinking water. 

7. This water has a bad taste. I don’t think I could 
accept it as my everyday drinking water. 

8. This water has a real bad taste. I don’t think 
I would ever drink it. 

9. This water has a terrible taste. I would never 
drink it. 


tain an assessment of the reliability of each 
rating scale and to compare, across scales, the 
magnitude of mean reliability coefficients. 
Also the research was designed to yield some 
information regarding the validity of the 
scales. In general, judges should rate synthetic 


TABLE 2 
SCALE AND DISPERSION VALUES FOR SELECTED Taste SCALE ITEMS 


Hedonic scale 


Item no. EAI SI 


El 


1 10.66 1.02 1.71 | .23 | 10.67 | 1.16 
2 9.94 1.40 1.48 | .34 | 9.79 | 1.31 
3 7.50 | 1.52 38 150 | 8.45 | 1.44 
4 6.98 | .60 04 23 | 7.16 -74 
5 5.98 54 —.20| .05 | 6.01 | -56 
6 4.87 6 —.42| .11 | 4.64 113 
7 3.98 | 1.02] —.56 | .17 | 2.95 
8 1.86 1.20 —1.38 | 51 | 2.05 1,22 
9 1.24 | 1.07 | —1.98 | .84 1.16 


a Equal appearing intervals. 
b Successive intervals. 


Combination scale 


EAL S* 
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1. This water has an excellent taste. 
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water samples containing a lesser amount of 
solute more favorably than samples contain- 
ing a greater amount of the same solute 
(Bruvold & Pangborn, 1966). Further, it is 
expected that ratings of equal concentration 
sodium solutions should vary significantly by 
anion, with sulfate and bicarbonate solutions 
receiving the most favorable ratings, chloride 
intermediate ratings, and carbonate the least 
favorable ratings (Bruvold & Pangborn, 
1966). 


Method. Twenty-four individuals served volun- 
tarily as Ss in the laboratory study. All were em- 
ployees of the California State Department of Public 
Health. Twelve Ss were male and 12 were female. 
Some Ss had participated in earlier research which 
involved rating the taste of water samples. None, 
however, was asked to serve in the present study on 
the basis of quality of performance in the earlier 
work. No S was highly experienced in the sensory 
evaluation of taste, odor, or flavor. 

The nine statements comprising a scale appeared, 
unnumbered, along the left-hand edge of a mimeo- 
graphed data sheet. The statements were ordered 
from favorable to unfavorable, starting with the 
uppermost statement, To the right of each statement 
there appeared a row of 10 cross marks. The nine 
rows of 10 cross marks each were also ordered by 
column to correspond to 10 water samples labeled 
by code letters. Instructions for the rating procedure 
appeared at the top of each mimeographed data 
sheet. 

Ten water samples were used in the laboratory 
work. Double distilled water was used as one sample 
as was Berkeley tap water. The latter is an odor- 
free water containing approximately 85 mg/l of 
total dissolved solids (TDS). Solutions prepared in 
the laboratory comprised the eight remaining water 
samples. Reagent grade NaHCOs, NasSO., NaCl, and 
Na:COs were used as solutes, and double distilled 
water as the solvent, in preparing solutions of 1,000 
and 2,000 mg/l. Each solute was represented at each 
concentration level. 

Each S took part in nine separate rating sessions. 
Successive sessions for each S were separated by 1 
wk. The first session was used only as a practice 
session. The remaining eight sessions were divided 
into two blocks of four sessions each. Each scale was 
used once by each S during each block. A list of the 
24 possible orders of presentation for the four scales 
was tabulated. The order in which Ss used the scales 
was determined by random matching of Ss and 
orders separately for Block 1 and for Block 2. The 
practice session involved the same scale as Session 
1 of Block 1. The average number of interpolated 
sessions intervening between like scales, over all Ss, 
was 3.00 for the hedonic scale, 3.00 for the quality 
scale, 3.08 for the action tendency scale, and 2.92 for 
the combination scale. 

The actual rating task required S to take three 
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TABLE 3 


RELIABILITY COEFFICIENTS FOR THE 
Four Taste SCALES 


Frequencies 
Reliability = i 
coefficient ; 8 Action | Combi- 
pigs eral Tendency| nation 
Peale 1 scale scale 
90.99 ii 2 1 2 
80-89 4 6 4 8 
70.79 5 8 5 9 
60.69 4 4 4 + 
50.59 5 2 5 1 
40-.49 3 1 1 0 
-00-.39 2 1 4 0 


separate mouthfuls of each sample, to record his 
rating on the data sheet by circling the appropriate 
cross mark, to rinse his mouth thoroughly with 
Berkeley tap water three times, and to rest for 30 
sec. before rating the next sample. The instruc- 
tions were read to each S at the start of the practice 
session by the experimenter who remained with S 
throughout this initial session to see that all pro- 
cedures were correctly followed and that the 10 
samples were rated in the order specified. The Ss 
worked alone in all remaining sessions. 

The rating was conducted in a small, quiet, air- 
conditioned room where the temperature was main- 
tained at 70 +2°F, All water samples were served 
at room temperature in letter-coded 100-ml. beakers 
filled to the 75-ml. level. Order of sample presenta- 
tion was randomized for each individual rating ses- 
sion. A different coding system was used for cach of 
the 9 wk. of the study. 


Results. All analyses performed upon the 
laboratory results, used equal appearing in- 
terval scale values as raw data. Standard 
integer values of one through nine were not 
employed because of inequality of item spac- 
ing especially evident in the hedonic scale. 

Pearson product-moment correlation coef- 
ficients were computed for all combinations 


TABLE 4 


AVERAGE INTERCORRELATIONS BETWEEN THE 
Four RATING SCALES 


i bi- 
Scale | Hedonic | Quality fe eee ae 
Hedonic 63 
Quality .70 73 
Action 
Tendency 66 69 62 
Combination 10 72 HL 0 
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TABLE 5 
x pi MEAN RATINGS FOR War Saurtes Usen iw tax Lasoratosy Sropy 
Solution ANEN Hedonic scale | Quality scale | 1 
Distilled nil 71 1.37 7. =" eon 
Berkeley tap 85 7.59 7.67 8.36 733 
Na:SOx 1,000 6.07 5.79 688 6.08 
NaHCO; 1,000 5.40 515 5.79 536 
NaCl 1,000 4.85 449 543 S33 
Na:COs 1,000 334 3.11 3.58 3.0 
NaSO, 2,000 5.23 480 5.90 s3 
NaHCO; 2,000 4.68 4.70 82 4.65 
NaC1 2,000 3.73 3.61 3.98 3.77 
Na,CO; 2,000 1.82 1.81 195 | 214 


of scales and sessions separately for each S. 
Thus, 28 correlation coefficients were obtained 
per S. Each correlation coefficient describes 
the relation between ratings given identical 
water samples for a particular pair of rating 
sessions. Of the 28 correlation coefficients, 4 
involve like scales, and therefore these coef- 
ficients may be taken as measures of the con- 
sistency or reliability of rating by a single 
S. The reliability coefficients for all Ss and 
scales are reported in Table 3. Individual re- 
liability coefficients were transformed from r 
to z values and then subjected to a Treat- 
ment (Scales) & Subjects analysis of vari- 
ance. An F ratio of 4.48 (df = 3/69) was 
obtained for scales. This ratio is significant at 
the .05 level for a conservative test using 1 
and 23 degrees of freedom (Winer, 1962). 
The average intercorrelations between scales, 
over all sessions and Ss, are shown in Table 4. 

Mean ratings averaged over sessions and 
Ss for each sample and each scale are shown 
in Table 5. The results of repeated-measures 
analyses of variance (Winer, 1962), analyses 
using each S’s average rating for each water 
sample over the two sessions involving like 
scales as raw data, are shown in Table 6. 
Sex of S, solute concentration, and type of 
Mineral solute formed the three factors in these 
analyses, Ratings for distilled and Berkeley 
tap water were not included in the analyses 
of variance. 


CONSUMER SURVEYS 


The rating scales developed can be used by 
Ss to rate water samples in a laboratory set- 


ting, and they can also be used by consumers 
to rate the taste of water received for daily 
consumption. The consumer research was de- 
signed to provide an assessment of the re- 
liability of the quality rating scale by cor- 
relating results from it with attitude scale 
scores. Further, the surveys were designed to 
provide some evaluation of the validity of the 
quality scale by correlating scale ratings with 
TDS in the waters studied. Theoretical no- 
tions concerning consumer attitudes toward 
water clearly indicate that there should be an 
imperfect, but substantial, negative correla- 
tion between TDS and quality scale ratings 
(Dillehay et al., 1967). 

Method. Four water systems serving communities 


in the Central California coastal area were selected 
as testing sites for the quality rating scale. Each 


TABLE 6 
ANALYSES OF VARIANCE FOR LABORATORY RATINGS 


Action 
Source | df | MSPS" | ORS | Tendency 


S A 1 0.28 0.40 1.30 
S 2 groups 22 3.16 1,82 661 
Sor at 1 | 53.47% | 30.13% | 64.03% 
AXB 70 0.16 1.47 1.61 
hi 
R A 22 0.51 1.78 1.18 
inerals 3 | 84.30% | 76.69% | 114.91% 
Mi c © 3| 0.39 1.59 0.52 
hi 
E ere n 66 1.30 1.60 2.17 
Cc 3 1.47 2.14 2.82% 
A ¥ BXC 3 0.79 134 1.66 
CXS 
2 . 66 0.95 0.94 1.02 
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TABLE 7 
Dara ror THE FOUR Co SURVEYED 
Midrange mineral content in mg/l 
Water wi T Mean | Mean | Mean No. re 
ATS AATS TSR vondent 
eee TDS | Ca | Mg | Na HCO, ci No, So. pi 
— — —— |—- |  - 2 
Los Osos 123 5 5 17 32 26 9 3 | 884 | 8.74 | 9.80 54} 
San Miguel 585 41 42 82 286 73 14 111 | 7.64 | 7.37 8.51 55 
El Rio 1,001 140 53 80 266 45 12 438 | 7.28 | 6.89 | 7.42 59 
Guadalupe | 2,078 | 287 128 122 288 154 14 1,016 | 5.26 | 5.16 | 5.23 | Si* 


| 
Í 
i 


One respondent's TSR missing. 


system surveyed was supplied by odor-free water 
sources as determined by on-site odor detection 
threshold tests (American Public Health Association, 
1965), and no source contained iron or manganese 
in amounts greater than .05 mg/l. No system chlo- 
rinated, and all were supplied by wells whose com- 
mon mineral content was relatively constant over the 
system and across the seasons. Chemical analyses and 
other data for each community water system sur- 
veyed are shown in Table 7. 

Selection of respondents from each community was 
accomplished by a careful sampling procedure that 
involved listing all dwelling units within the geo- 
graphical area served by the water system under 
study, the selection of approximately 60 dwelling 
units by a random sampling procedure modified to 
distribute selected dwelling units uniformly over the 
area, and, finally, the random selection of one Eng- 
lish-reading individual 15 yr. of age, or older, from 
all such individuals residing at each selected dwelling 
unit. Each respondent participating in the survey 
was first given an interview, then three attitude 
scales (Dillehay et al., 1967), and finally the quality 
rating scale. The interview and scales were ad- 
ministered at the respondent’s home by a trained 
interviewer working for the California State Depart- 
ment of Public Health. 

Each participating respondent drew a glass of water 
from his kitchen tap before the taste scale rating 
procedure was initiated. Having obtained a glass of 
water, the respondent was given a copy of the qual- 
ity scale in which the nine items appeared unnum- 
bered and ordered from most to least favorable. 
The interviewer read each item comprising the scale 
to the respondent. The respondent was then in- 
structed to take three mouthfuls of water from the 


TABLE 8 


INTERCORRELATIONS BETWEEN TDS AND 
CONSUMER SCALE SCORES 


AATS ATS TSR 
ATS -93 
TSR 83 84 
TDS —.6³ —.67 EL 


glass, to study the scale items carefully between 
mouthfuls, and to indicate, after taking the third 
mouthful, which statement most accurately expressed 
his rating of the taste of the water. 


Results. In El Rio, 100% of the selected 
respondents participated in the survey. Anal- 
ogous percentages were 93, 95, and 95, respec- 
tively, for San Miguel, Los Osos, and Guada- 
lupe. 

Intercorrelations between quality taste scale 
ratings (TSR), attitude taste scale scores 
(ATS), attitude adjective taste scale scores 
(AATS), and TDS as milligrams per liter are 
shown in Table 8. Each correlation coefficient 
is based upon scale responses obtained from 
all four towns. Correlation coefficients for 
taste scale ratings involve 217 respondents, 
while those for attitude scale scores involve 
219 respondents. Equal appearing interval 
scale values were used in all TSR analyses. 

An analysis of linear regression (McNemar, 
1962) was performed for each set of scale 
scores and TDS midranges. The results of 
the analyses of variance are shown in Table 9. 
A plot of TSR, ATS, and AATS means against 


TABLE 9 


ANALYSES OF VARIANCE FOR CONSUMER 
Survey RESULTS 


ATS AATS TSR 
Source 
af | ms | af | ms | af | MS 

Linear * 

regression 1 340.88“ 1 | 333.74* 1 | 573.68 
Deviation from 25 
Dale 2 2.68 n 2) 25 

etween- array 

means 3) 11s.a0¥| 3114819342 
Within arrays | 215 | 1.90 215 2.35 | 213 ae i 
Residual 217 1.91 | 217 2.37 2185 3- 

* p L. 001. 


TSR(Y¥*-0,.00230 X +991) e 
-ATS(Y=-0.00177 X 48.92) è 


à AATS(Y*-0,.00174X+8,69) 


Meon Scole Scores 
o-nus „„ „„ „ 8 


1000 2,000 


TOS in Milligrams per Liter 


Fic. 2. Lines of best fit for TDS midranges and 
mean scale scores. 


TDS midranges, and the line of best fit for 
each scale, is pictured in Figure 2. 


i Discussion 
Scale Construction 


__ Edwards (1957) holds that the relation be- 
_ tween equal appearing and successive interval 
Kale values for the same set of items will be 
linear only in the middle range, with succes- 
ve interval values becoming more extreme 
near the ends of the continuum. 1 
prediction was clearly borne out by the rela- 
tion between successive interval values for 
general hedonic scale items reported by Jones 
and Thurstone (1955) and equal appearing 
interval values for water taste hedonic scale 
items. This result is pictured in Figure 1. 
However, none of the plots relating successive 
and equal appearing interval values for the 
= four sets of items here scaled gave clear sup- 
port to Edwards’ assertion. Instead, the overall 
linearity obtained between equal 
and successive interval values shows that the 
two methods yielded equivalent results for 
he statements scaled in this research. 


Laboratory Study 

The reliability coefficients based upon the 
laboratory work varied substantially actoss 
Ss. Such a result was not unexpected since 9 
bor this study were volunteers not highly ex- 
perienced or skilled in sensory evaluation 
Vork. Nevertheless, 55 of the 96 individual 
| téliability coefficients were equal to, or greater 
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quality or the combination scales 
Schutz (1965) found an action tendency 
scale to be superior to an hedonic Kale in rat- 
ing food products. In terms of reliability, so 
such advantage was found for the action 
tendency scale in the present study. The sta- 
tistically, and practically, significant difer- 
ences between mean reliability coefficients 
were not fully anticipated. The lower relis- 
bility of the action tendency scale may be duc 
to the difficulty an S encounters in predicting 
his actual response to a certain mineralized 
water solely on the basis of laboratory ex- 
posure to that water. The lower reliability of 
the hedonic scale may be due to the rather 
gross inequality of item spacing along this 
continuum. Whatever the cause of the find- 
ings, the results of the reliability study, the 
spacing of items, the relative size of the dis- 
persion values, and simplicity of phrasing, all 
indicate that the quality scale is likely the 
most useful of the four here investigated. This 
judgment must be tempered by the purpose 
of the research in which the scale is to be 
employed, and also, perhaps, by future find- 
ings involving additional assessments of scale 
uality. 
2 The analyses of variance performed upon Sa’ 
average ratings for sodium solutions showed 
that there was a significant difference between 
means associated with sample concentration 
on each of the four scales. This finding is well 
illustrated by Table 5. Each 1,000 mg/l sam- 
ple received a more favorable mean rating 
than the corresponding 2,000 mg/l sample. 
This result supports the validity of the four 
scales as it was expected that general taste 
ity would deteriorate as amount of min- 
eral dissolved in water . 
The analysis of variance on Ss’ average 
ratings also showed that there was a significant 
F ratio associated with solution anions. This 
result parallels earlier findings with the 14- 
item combination scale (Bruvold & Pang- 
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born, 1966). It now seems well established 
that mean taste scale ratings for equal con- 
centration sodium solutions will clearly rank 
by anions, with sulfate and bicarbonate rank- 
ing highest, chloride intermediate, and car- 
bonate lowest. As pointed out in connection 
with the earlier work, this rating is identical 
with the rank order of detection threshold 
values for sodium salts in water (Bruvold & 
Pangborn, 1966). The congruence of present 
findings with earlier ratings and with detection 
threshold results further supports the validity 
of the rating scales here described. 


Consumer Surveys 


The correlations between the quality rating 
scale and the two attitude scales may be con- 
sidered as reliability coefficients since both 
procedures were designed to measure the 
same attribute. Such a reliability coefficient 
is most closely related to a coefficient of 
equivalence obtained by administering equiva- 
lent forms of an instrument in close succes- 
sion. In addition to usual error variance due 
to chance factors and differences in item con- 
tent, the present measure of reliability will be 
affected by differences between the rating and 
attitude scale methods of obtaining an in- 
dividual score. Since the obtained correlation 
coefficients of .83 and .84 are substantial in 
spite of the additional source of unreliability, 
it may be concluded that the quality rating 
scale possesses reliability sufficient for use in 
survey research of the kind reported in this 
paper. Further, it should be noted that the 
correlation between TDS and scale scores was 
of similar magnitude for both techniques of 
measurement. 

Reliability, in the broader sense of con- 
sistency of results, may also be evaluated by 
comparing the regression of rating and at- 
titude scale scores on TDS. The analyses re- 
ported in Figure 2 and Table 9 show that 
means for all three scales were linearly related 
to TDS midranges for towns. The rating 
scale means fit a line of somewhat steeper 
slope than those fitting attitude scale means; 
however, such a difference in slopes would 
not be of much practical importance in recom- 
mending a limiting standard for total mineral 
content in domestic water. The degree of con- 
sistency obtained for the regression results 
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provides further evidence that the rating 
scale possesses reliability sufficient for use in 
consumer survey research. 

The significance of differences between 
mean ratings for scale scores over towns, and 
the significance of the negative slope fitting 
the scale means and TDS midranges, provides 
evidence supporting the hypothesis that there 
is an inverse relationship between favorability 
of consumer attitudes toward water and its 
total common mineral content (Dillehay et 
al., 1967). These results support the validity 
of the rating scale, and provide further sup- 
port for the validity of the attitude scales. 

The linear relation between TDS midranges 
and rating scale scores for the four water sys- 
tems studied may appear to be at variance 
with laboratory work which showed that im- 
portant differences existed between mean rat- 
ings for equal concentration mineral solutions. 
The apparent difficulty is resolved by noting 
that, with the exception of bicarbonate for 
San Miguel and chloride for El Rio, the 
relations between ionic concentrations and 
TDS were approximately linear across towns. 
Nothing in the laboratory work completed to 
date suggests that a nonlinear relation will be 
obtained between TDS and mean taste scale 
ratings when each mineral ion present main- 
tains a relatively constant proportion to TDS. 
It is also worth noting that the more mineral- 
ized natural waters surveyed were relatively 
low in chlorides and high in sulfates. This fact 
probably explains the rather moderate mean 
ratings given El Rio and Guadalupe water. 
Had these two waters been relatively high in 
chlorides and low in sulfates, more negative 
ratings would very probably have been ob- 
tained. 


CONCLUSIONS 


The performance of the quality rating scale 
in the laboratory and consumer survey re- 
search reported here indicates that it may 
now profitably be employed in future work 
designed to recommend limiting standards 
for common mineral content in domestic water 
(Bruvold et al., 1967). Consumer ratings ob- 
tained with the quality scale represent mea- 
surement of the attribute of interest by the 
judgment method (Torgerson, 1958). The 
previously constructed attitude scales (Dil- 
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lehay et al., 1967) obtain a measure of the 
attribute of interest by the response method. 
Use of both judgment and response methods 
in the consumer survey research will give ad- 
ditional confidence to the standards finally 
recommended. 

Taste panel ratings obtained using the 
quality scale and water samples from each 
community surveyed may also be extremely 
useful in establishing the desired standards. 
The taste panel ratings may be considered 
analogous to chemical analyses performed 
upon water samples from a particular com- 
munity in the laboratory. Results from the 
chemical analyses, such as TDS values, are 
employed to describe the relation between 
mineral content in water and potability as 
operationally defined by consumer attitude or 
rating scale scores. If it is found that the 
mean taste panel ratings allow for significant 
improvement in describing the relation be- 
tween consumer responses and results from 
laboratory analyses, taste panel ratings may 
be incorporated into a regression equation 
that would be used to predict the potability of 
mineralized waters not assessed in the con- 
sumer survey work. Here the regression equa- 
tion relating results of laboratory analyses to 
consumer response comes to embody the de- 
sired standard, and, as indicated, inclusion 
of taste panel data in this equation may con- 
siderably increase its accuracy. 

The newly developed attitude and rating 
scales can also be used to assess flavor in 
water produced by dissolved constituents other 
than common minerals. In fact, increasing in- 
terest in defining and protecting water quality 
(Bean, 1962) indicates that considerable re- 
search will be performed in order to establish 
standards for taste and odor-producing con- 
stituents commonly found in water used for 
domestic supply. The scales developed make 
possible an interlocking consumer evaluation- 
taste panel approach to the assessment of 
flavor in domestic water. Results from the 
scales may be used to recommend a limiting 
standard in terms of an upper limit of ac- 
ceptability whenever the threshold approach 
is not feasible (Bruvold et al., 1967). When 


consumers of taste pasel judges will provide 
fuller knowledge of the effects of the cosa. 


centration at which a particular constituent 
in water becomes unacceptable for daily 
drinking. 
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HOW TO SUCCEED IN BUSINESS ACCORDING TO 


BUSINESS STUDENTS AND MANAGERS 


BERNARD M. BASS 


Management Research Center, University of Rochester 


The Organizational Success Questionnaire was developed to see the extent 
graduate business students and middle managers accept the opinions of the 
social theorists and the political theorists about what it takes to succeed in 
large organizations. 6 factors emerged in a Ist analysis but failed to remain 
sufficiently independent. Consequently, a social approach scale was constructed 
with a mean internal consistency reliability of .72 and a mean retest reliability 
of .S2. A political scale yielded similar reliabilitics. Slight but meaningful rela- 
tions were found between scale responses and responses to other inventories of 
orientation and value. Both social and political approaches are needed, fairly 
or very often, for success according to the respondents (the social somewhat 
more than the political). But initially students favored less activity of cither 
kind compared to middle managers, Following training the managers remained 


unchanged but the students became more like the managers. 


When your boss asks you to push your sub- 
ordinates to work harder and your subordi- 
nates complain they already have too much 
to do, what do you do? What do you think 
the successful executive does? Faced by these 
and countless other conflicts of interests 
among those above, below, and next to you 
in the system, do you concentrate on being 
open, trusting, and communicative, as prob- 
ably would be recommended by most socially 
oriented t experts such as Gibb 
(1964), Argyris (1962), or Bennis (1964); 
or, on the contrary, do you become the astute 
dissembling politician, maintaining your dis- 
tance, controlling the outflow of information, 
indulging in minor insincerities, and making 
superficial compromises, as probably would be 
recommended by most politically oriented 

ent experts, such as Martin and 
Sims (1956), Jameson (1945), or Pfiffner 
(1951)? Gibb argues that before effective in- 
terpersonal decision-making procedures can 
be developed that lead to real commitments, 
a trusting relationship must be established 
between the persons. Argyris gives highest 
priority to “owning up” to one’s own feelings 
and needs. On the other hand, Jameson em- 
phasizes that to be effective in such circum- 
stances, one needs to wear a “mask,” while 
Pfiffner calls for hiding one’s own feelings, 
engaging in benign chicanery, and being a 
good actor. The present investigation set out 
to develop a brief attitude scale which would 


make it possible to see how managers and 
students felt about the issue, and what ef- 
fects a particular training program had on 
these attitudes. 


DEVELOPMENT OF THE ORGANIZATIONAL 
Success QuESTIONNAIRE 


As a first step in development of an attitude 
scale, that is, the Organizational Success 
Questionnaire (OSQ), 200 undergraduate stu- 
dents enrolled in an elementary psychology 
section were asked to complete a 16-item 
questionnaire under the following instructions: 

People differ in what they think it takes to get 
ahead today in most large organizations. Listed be- 
low are ways of behaving that may or may not be 
important for success. Indicate how frequently you 
think managers, staff members, or administrators 
ought to behave as described in order to be successful. 

Eight items were composed primarily from 
an article by Martin and Sims (1956) on the 
utility of the political approach and eight 
other statements were drawn from the writings 
of Gibb (1964) advocating the social ap- 
proach. 

A typical item was: 

Successful persons try to establish in advance mu- 
tually satisfactory objectives with others with whom 
they must work. 

5 — always] 

4 — very often 


1The frequency scale was based on a magnitude 
estimation study of 28 adverbs of frequency. Each 
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TABLE 2 
INTERCORRELATIONS AMONG THE Factor SCORES 


Correlation 
VI with same 
factor on 
| retest 
50°* 
— 14° 
—.12 
—.20 
38˙⁰ 


of other assessments. Results are shown in 
Table 3. 

Other assessments. Verbal and numerical 
aptitude were measured on the Admissions 
Test for Graduate Schools of Business. Self-, 
interaction, and task orientation were as- 
sessed by the Orientation Inventory (Bass, 
1962), a Kuder-type inventory of 27 triads. 
Values were assessed by the Allport-Vernon 
Study of Values (Allport, Vernon, & Lindzey, 


1960), and budgeting decisions were examined 
by Exercise Objectives, an exercise in which 
participants each choose whether or how to 
spend money on each of five problems: safety, 
labor relations, management development, 
product development, and stream clean-up. 
For each problem, one could choose a “soft” 
social solution, that is, fix equipment to avoid 
a possible accident, or a “hard” economic 
solution, that is, let the machine operator be- 


TABLE 3 


INTERCORRELATIONS OF OSQ Factor SCORES AND ASSESSMENTS OF ABILITY AND VALUES 


Factor scores 
Assessment 
I II III IV V VI 
Verbal aptitude (ATGSB) .00 05 —.12 —.0¹ 10 06 
Numerical aptitude (ATGSB) .00 —.09 —.24** 03 06 —.03 
Greater human and social con- —.20* A9* 30** 25 —.17* 25% 
cern in five budgeting decisions 
Orientation Inventory 
Self-orientation 14 —.15* —.14 —.24** 05 —.02 
Interaction orientation 06 .08 08 —.03 07 04 
Task orientation —.18* 04 01 .20* —.10 — 01 
Allport-Vernon Study of Values 
Theoretical —.13 Sid —.25** .20* 02 -01 
Economic 15 —.26%* | —15* —.07 08 19" 
Aesthetic 11 08 10 =.05 08 — 13 
Social —.05 10 15* 16* —.09 —.08 
Political 1055 08 —.0⁴ —.00 —.02 AS 
Religious 04 15* 13 00 03 —. 


Note. —Factor I = Emphasize personal gain; II = Emphasize organizational gain; III = Share decision making; IV = Em- 


phasize candor, openness, trust; V = Bluff, 
*p <.05,r = .15 for 111 df. 
*% S. 01, 7 = .22 for 111 df. 


» obscure issues; VI = Maintain social distance and prerogatives. 
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ware. The total number of “Social” choices 
was calculated and correlated with OSQ re 


sponses. 

Several other exercises of the Program of 
Exercises (Bass, 1967) were also scored 
against OSQ responses, but failed to yield 
significant or meaningful patterns of results, 
possibly because of the low reliabilities of the 
factor scores. 

OSQ and aptitude, Numerical aptitude cor- 
related —.24 with sharing. No other sig- 
nificant relations were observed. 

OSQ and orientation, From Table 3 it can 
be seen that selj-orientation on the Orienta- 
tion Inventory the tendency to seek recogni- 
tion and personal rewards and to prefer to 
work alone rather than in groups was nega- 
tively related (—.15, —.14, and —.24) with 
OSQ Factors II, ITI, and IV, concerned with 
emphasizing organizational gain, sharing de- 
cisions, and fostering trust. On the other hand, 
task-orientation—interest in the challenge of 
the work to be done—was positively associated 
(.20) with the OSQ trust score (IV) and also 
unlike gel /- orientation was negatively related 
(—.18) with the OSQ factor score for per- 
sonal gain (I). 

OSO and values. Theoretical value on the 
Allport-Vernon—appreciation of understand- 
ing—was negatively associated (—.25) with 
sharing of decisions (III) but positively 
(20) with trust (IV). Valuing economic ac- 
tivity was positively associated with emphasiz- 
ing social distance (.19) and negatively with 
emphasizing organizational gain (—.26) and 
sharing (—.15). Contrary to , 
strong social values were only slightly related 
to belief in the importance to success of shar- 
ing (.15) and trust (.16), while strong politi- 
cal values were only slightly related to em- 
phasizing personal gain (.19) and social dis- 
tance (.15) and not at all to bluffing. Strong 
religious values correlated .15 with emphasiz- 
ing organizational gain. While relations here 
were in the direction expected, it can be seen 
that the brief OSQ factor scores could in no 
way substitute for the much more reliable All- 
port-Vernon Study of Values and would be 
severely limited to survey and research ap- 
plication. tee 

OSO and budgeting decisions. ini a 
ing are the positive correlations of 30 and .25 


* 


between the tendency to mate “social” boad- 
geting deciiions aad the OSQ Factors III and 


tive correlations of —20, —.17, and —25 
between these “soft” decihiloa tendencies and 
belief in the utility of political approaches to 
organization emphasizing personal gain (1), 
bluffing (V), and social distance (VI). 


Rescoring 


The unreliability of the factor scores, their 
tendency to cluster rather than remain inde- 
pendent, and the patterns of their correlations 
with other assesuments led to the pooling of 
items from the social factor scores into one 
social approach score and to the pooling of 
political factor items into one political ap- 
proach scoret 

Twelve of the original 16 items were clus- 
tered into two composites, a social and a 
political, to form the final scale. 

The four items which were eliminated 
loaded highly on more than one factor or 
loaded only on Factor II dealing with or- 

be 


tively on a factor. 

The final set of 12 items and whether each 
is scored as social (S) or political (P) are 
shown in Figure 1. 


Scale Reliabilities and Intercorrelations 
For a total of 196 cases, test-retest relia- 


i ly more dis- 
inating factor scales was rejected. First, the 
crcl an — dichotomy was in line with the 
rationale with which the author had begun. Second, 
the resulting lengthy questionnaire would be 

more difficult to administer in the future to 


samples. 
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ORGANIZATIONAL SUCCESS QUESTIONNAIRE 


People differ in what they think it takes to get 
ahead today in most large organizations, Listed be- 
low are ways of behaving that may or may not be 
important for success. Indicate how frequently you 
think managers, staff members, or administrators 
ought to behave as described in order to be success- 
ful. 

Use the following scale to indicate your opinion: 


5—They always ought to do this. 
4—They ought to do this sery often. 
3—They ought to do this fairly often. 
2— They ought to do this sometimes, 
1—They seldom ought to do this. 
O—They never ought to do this. 


S 1. Level with others; be open, frank and can- 
did in their communications with others, 
Withhold the release of information or 
time its release for when it will do the most 
good. 

3. Share in decision-making with their sub- 
ordinates whenever possible. 

. Act confidently when they are personally 
unsure about matters or when they lack 
relevant information. 

. Make political alliances with superiors and 

subordinates to foster and protect mutual 
interests. 

. Completely and openly commit themselves 
to a position or program. 

S 7. Try to establish in advance mutually satis- 
factory objectives with others with whom 
they must work. 

. Initiate actions which they are personally 
against but retard and delay carrying out 
the actions so that the actions are in 
progress but never completed. 

P 9. Maintain social distance; remain aloof, de- 
tached, uninvolved with others; always 
remain the boss when interacting with sub- 
ordinates. 

|. Foster mutual trust with others. 

P 11. Openly compromise, yet privately divert or 
delay compromise plans so that their own 
aims will be pursued despite the stated 
compromise, 

S 12. Arrange for their superiors and their sub- 
ordinates to meet together to encourage 
group discussions among others above and 
below them in the organization with easy 
participation by all. 


Fic. 1. Organizational Success Questionnaire with each 
item scored as social (S) or political (P). 


3 


EFFECTS OF ORGANIZATIONAL EXPERIENCE 
Before they entered training programs ? in 
organizational psychology, OSQ data were col- 


3 The training program focused on completing 10 
small group exercises dealing with budgeting, expecta- 
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TABLE 4 


How To SUCCEED IN Business ACCORDINO TO Grane. 
ate Business Stupents AND MippLe Manaorns 
BEFORE AND AFTER TRAINING 


Mean attitude 


Spk Pre- Post- — 
training training Taai 
Social items 
83 graduate business 3.62 3.82 | 3.90 
students (days) | 
24 graduate business} 3.56 3.84 3.96 
students (night) 
89 middle managers 4.02 4.07 3.93 
Political items 
83 graduate business 3.30 3.26 3.32 
students (day) 
24 graduate business 3.17 3.60 3.73 
students (night) 
89 middle managers 3.61 3.69 3.59 


lected from 89 graduate business students, 
most of whom had had no industrial experi- 
ence; these data were contrasted with a sam- 
ple of 24 night students who had had an aver- 
age of approximately 5 years of experience 
and a sample of 89 upper-middle managers 
of large firms who had had about 20 years of 
experience, on the average. y 
Table 4, Column 1, contrasts the mean dif- 
ferences found between the three samples on 
social approach items, then again on political 
approach items. All social means were signifi- 


tions, compensation, life goals, supervision, group 
dynamics, planning, communication, negotiations, an 
evaluation. Lectures and T-groups were interwoven 
in the program. Typically, discussions following 
each exercise noted that there was no one best way 
to manage and that a manager had to balance many 
objectives, be flexible in relating to subordinates, 
cope with the conflict between individual participa- 
tion and organizational demands, etc. Neither ; 
complete social nor a complete political point © 
view was advocated. For more detail, see Bass (1967). 
The students completed the approximately 40-hour 
program in 15 weeks; the managers, in 1 week. 


cantly higher than the corresponding 
means for the same samples. 


political 
Table 5, Column 1, contains the two ap- 
of these 


! 
| 
iy 


j 
ri 
; 
f 
i 


b and Argyris thas with Martin, Sias, Jamesa, 
propriate analyses of the variance and Påfner, Yet, they subvcr®wd almost as 
means. In both cases, the obtained much to the latter men's polst of view as 


i 


results 
could not be accounted for by chance. Middle 
managers felt that to be successful in large 
organizations required greater frequencies of 
the social as well as the political 
described by the items. The mean 
for managers was 4,02, while it 
3.56, respectively, for the student 
What managers saw as needed very 
perceived to be needed 
very often by the students. Somewhat 
frequencies of political activity were 
ceived as needed, but again, managers (3 
felt more of this activity was required for 
success than did the graduate students (3.30 
and 3.17). The correlation of social and politi- to 288. 
cal approach means among the three groups A were 
probably reflected to some extent the correla- ing, is, acting confident even when unsure 
tion mentioned earlier among the individuals about matters, Probably this item 
within the groups. 
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three 
of social and political approaches for success. r 
All three groups favored somewhat greater 
use of the social approaches compared to the the three samples for the various social ap- 
political (3.62 versus 3.30, 3.56 versus 3.30, proach items occurred in 


TABLE 5 


ARIANCE OF THE SOCIAL AND Poumicar Soomes ow OSQ 
e — 2 Stipan ih 


Between samples 
Error 
Total 


Between samples 
Error 
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TABLE 6 
SPECIFIC DIFFERENCES BETWEEN GRADUATE BUSINESS STUDENTS AND MIDDLE 
MANAGERS BEFORE AND AFTER TRAINING 
— = = — = — 
89 graduate 24 graduate À 
Approved aer business business stu- Nae All (N = 196) 
8 students (day) | dents (night) 8 
Social 
Level, be candid Before 3.69 3.46 4.20 3.88 
After 3.72 3.85 4.04 3.86 
Share decisions Before 3.19 3.50 4.17 3.64 
After 3.76 4.02 4.23 4.08 
Commit oneself fully Before 3.20 3.13 3.59 3.36 
After 2.98 3.38 3.83 3.38 
Set mutual objectives Before 4.28 3.96 4.35 4.27 
After 4.36 4.29 4.25 4.29 
Foster mutual trust Before 4.58 4.58 4.75 4.65 
After 4.64 4.73 4.86 4.73 
Hold group discussions Before 2.76 2.75 3.06 2.89 
After 3.45 3.28 3:22 3.28 
Political 
Withhold information Before 2.85 2.46 3.25 2.87 
After 3.13 3.63 3.61 3.40 
Bluff Before 1.98 2.21 2.33 2.15 
After 2.06 2.38 2.25 2.18 
Make alliances Before 3.42 3.04 4.01 3.62 
After 3.26 3.38 4.13 3.64 
Retard and delay Before 4.27 4.17 4.37 4.30 
After 3.99 4.13 4.28 4.13 
Maintain distance Before 3.07 3.00 3.31 3.16 
After 3.12 3.83 3.22 3.25 
Compromise but divert Before 4.21 4.17 4.39 4.28 
After 3.97 4.29 4.61 4.29 


Note.—5 = always; 4 = very often; 3 = fairly often; 2 = sometimes; 1 = seldom; 0 = never. 


the need to share decision making with sub- 
ordinates whenever possible. The experienced 
managers saw this as needing to be done very 
often (4.17), the night students with some 
experience felt less strongly about this (3.50), 
while the experienced day students called for 
this action only fairly often (3.17). Again, 
managers were most in favor of leveling, 
openness, candor, and full commitment com- 
pared to the students. 

On political approaches the managers and 
students were widest apart on the utility of 


forming political alliances to suit personal 
ambitions. The managers saw this as needed 
very often (4.01), the students less so (3.04 
and 3.42). The managers also saw the need 
to withhold information more frequently 
(3.25) than did the students (2.46 and 7,85): 


EFFECTS OF TRAINING IN ORGANIZATIONAL 
PsycHoLocyY 
Table 4, Column 2, shows the actual mean 


scores of the three samples following training, 
and Table 4, Column 3, shows the same 


How to Succren m Business 


mean scores after the effects have been sub- 
tracted from them of the differences among 
the participants prior to training. Table 5 dis- 
plays the appropriate covariance analysis of 
these adjusted posttraining means, From these 
results we infer that the graduate students, 
who initially had been significantly (F = 
7.28) lower than the managers in calling for 
social approaches, increased in this point of 
view during training while the managers re- 
main unchanged. At the conclusion of train- 
ing students had become like the managers 
(F = .20) on this dimension. With reference 
to political approaches, both day students as 
well as the managers remained unchanged in 
outlook, but as a consequence of training 
more experienced night students increased in 
their belief in the efficacy of political activity 
so much so that their actual final scores on 
this dimension matched those of the managers 
and their adjusted mean surpassed that of the 
managers. 


Specific Differences (after Training) 


Table 6 shows the responses to each of the 
12 OSQ items after as well as before the stu- 
dents and managers completed the course 
which included the Program of Exercises in 
Management and Organizational Psychology. 

Two multivariate analyses of covariance 
were completed on these data. In the first, 
the final six social approach responses were 
adjusted according to their multiple correla- 
tions with the initial social and political re- 
sponses of the trainees. In the second analysis, 
the final six political approach responses of 
the trainees were adjusted according to their 
multiple correlations with the initial social and 
political responses of the trainees. In both 
analyses there remained at the .01 level of 
confidence significant sources of variance due 
to whether the trainees were day or night 
graduate students or middle managers. The 
respective Fs were 3.85 and 2.54 with 12 and 
352 degrees of freedom for each analysis.* ; 

Social analysis, The most interesting shifts 
in stress on social approaches occurred with 
reference to sharing decisions and holding 


For a discussion of the multivariate analysis of 
covariance see W. W. Cooley and P. R. Lohnes, 
Multivariate procedures for the behavioral sciences. 
New York: Wiley, 1962. 
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group discussions. It can be seen that although 
training raised all samples on how much they 
thought sharing decisions with subordinates 
was necessary, the students were affected 
much more than the managers, The student 
means rose toward those of what the managers 
had been. On the other hand, while there was 
a sharp increase by students in emphasis on 
holding group discussions, the comparable in- 
crease for managers was so much smaller that, 
after training, students stressed this more 
than did managers, a reversal of their respec- 
tive positions before training. 

Political analysis. All three samples in- 
creased significantly in their valuing the util- 
ity of withholding information or timing its 
release as a consequence of the training ef- 
fort. Bluffing remained as unpopular as be- 
fore training. While inexperienced students de- 
creased their support for forming alliances, 
experienced students and managers increased 
in their favoring this approach. There was a 
slight overall decline in all samples in valuing 
delaying tactics. Among the night graduate 
students, there was a sharp increase (3,00- 
3.83) in the opinion that maintaining social 
distance and aloofness was important for suc- 
cess in the large organization; the other two 
samples were unchanged. Finally, the middle 
managers seemed to have increased in their 
support of open compromise but secret diver- 
sion of plans. 


CONCLUSIONS 


A meaningful consistent set of responses 
emerge when students and managers are asked 
to indicate what they think it takes to succeed 
in large organizations. On the whole, they see 
that the fellow who gets ahead is one who 
adopts fairly or very often both social and 
political approaches, the former somewhat 
more than the latter. He is one who will try 
to foster trust, share in the decisions with 
others, and consider the organization’s needs. 
At the same time, to a somewhat lesser extent 
he is ready when necessary, to maintain his 
distance and his prerogatives, to act confident, 
even if he is not sure of himself, and to con- 
sider his own personal advantage. 

Orientation and values, per se, only slightly 
relate to these responses, that is, self-oriented 
respondents favor more political activity, and 
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the attitudes reflected in these responses tend 
to account for whether socially acceptable 
rather than economically determined budget- 
ing decisions will be made. 

Middle managers tend to see more of both 
the social and political approaches required 
of the successful man in the large organization 
than do graduate business students, but gen- 
erally the students become more like the man- 
agers as a consequence of a training program 
in organizational psychology while the man- 
agers seem less affected by the program. 
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JOB SATISFACTION AMONG GRADUATE STUDENTS: 
INTRINSIC VERSUS EXTRINSIC VARIABLES 
EDWARD L LEVINE axo JOSEPH WEITZ! 


New York University 


A survey of “job” satisfaction was conducted among graduate students in 2 
universities, A major source of dissatisfaction at both universities was found 
to be student voice in influencing department policy. When the responses were 
factor analyzed, Ist- and 2nd-order factors did not indicate an intrinsic or ex- 
trinsic dimension, nor did the findings support a 2-factor theory of “job satis- 
faction.” Furthermore, the alternative to the 2-factor theory, namely that in- 
trinsic variables are more importantly related to measures of overall satisfaction, 
was similarly not supported. The conclusion is that both the 2-factor theory 
and its alternative are oversimplifications, Finally, the survey results of general 


interest are presented. 


As in so many recent studies, much of our 
analysis takes departure from the two-factor 
theory of job satisfaction outlined by Herz- 
berg, Mausner, and Snyderman (1959). The 
results of tests of the theory, which was far 
from exact to begin with (Ewen, 1964), have 
been mixed, for and against. Burke (1966) 
offers an objective and fairly thorough sum- 
mary of much of the relevant literature, while 
Herzberg (1966), in his more recent book, 
offers studies supporting the theory and criti- 
effectively, the r 
methodology employed by the theory’s P- 
ponents. 

One of the main objections to the two- 
factor theory is that it is method bound. Most 
studies which support the theory utilize the 
same open-ended “‘critical incidents” approach 
employed in the original study (cf. Herzberg, 
1966, and the review by Hulin, 1966b). On 
the other hand, evidence against the two- 
factor theory has come from several sources— 
studies of dimensionality, including Q analy- 
ses (Dunnette, Campbell, & Hakel, 1967), 
factor analysis (Friedlander, 1963; and, as a 


The authors wish to thank Byron Fiman, Jeffrey 


Greenhaus, Robert Schoen, Theodore Venet, and 


Virginia Schein who helped construct the question- 
naire and collect the data. 


more direct test, Graen, 1966b), rotated prin- 
cipal components (Ewen, 1964), Coombs’ un- 
folding technique (Burke, 1966), and studies 
utilizing the overall satisfaction measures 
neglected by Herzberg et al. (1959) in their 
original study. Among the latter, analysis of 
variance (Friedlander, 1966; Graen, 1966a), 
multiple regression (Halpern, 1966), and a 
series of hypothesis tests by Ewen, Smith, 
Hulin, and Locke (1966) have been employed 
in evaluation of the two-factor theory. 

By way of results, the dimensionality 
studies have not succeeded in establishing any 
clear-cut intrinsic-extrinsic dichotomy. Fur- 
thermore, the other group of studies, along 
with evidence presented by Wernimont 
(1966), Dunnette et al. (1967), Hinrichs and 
Mischkind (1967), and Mischkind (1967), 
have revealed the weaknesses of the two-factor 
theory and have pointed to an alternative 
which postulates that intrinsic (to the job) 
variables are more important in producing 
overall satisfaction, as well as overall dis- 
satisfaction. 

Our study, a survey of job satisfaction 
among graduate students majoring in psy- 
chology, proposes to extend the work of these 
investigators. We intend to do a first- as well 
as a second-order factor analysis, as an addi- 
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tion to Ewen’s (1964), Friedlander’s (1963), 
and Graen’s (1966b) work, in the supposition 
that an intrinsic-extrinsic dichotomy may 
emerge in second-order factors. 

Moreover, Halpern’s (1966) test of the 
two-factor theory, in which he found that 
motivators account for most of the variance 
in overall satisfaction with a satisfied sample, 
has been vitiated to an extent by the emer- 
gence of the above-mentioned alternative the- 
ory. His results support the two-factor theory, 
as well as an alternative theory. The two- 
factor theory predicts that for people on the 
satisfied end of the continuum, “motivators” 
should be more strongly related to overall 
satisfaction than “hygienes.” On the other 
hand, for people on the dissatisfied end of the 
continuum, “hygienes” should explain more 
variance in overall satisfaction. 

The alternative theory, which might be 
called the two-factor-plus-importance theory, 
predicts that intrinsic variables should relate 
more strongly to overall satisfaction than 
extrinsic variables regardless of the level of 
overall satisfaction; that is, they are more 
important. 

Our results will bear on the two-factor-plus- 
importance prediction, since, in an extension 
of Halpern’s (1966) test to determine whether 
intrinsic variables explain more variance in 
overall satisfaction, we do not restrict our 
sample only to respondents who are satisfied 
overall. 

Survey data are rich in analytic possi- 
bilities, and we would be remiss in reporting 
only the results bearing on the controversial 
issues just considered. One interesting area we 
plan to explore is that of the moderator 
variable. 

Recently, moderator variables have come 
into prominence as a way of improving pre- 
dictions (Ghiselli, 1963; Weitz, 1966), and 
we will employ traits assumed to covary with 
sex for this purpose. 

Hypothesizing that females are more so- 
cially oriented than males, and males more 
independent,“ the predictions are: for fe- 
males, satisfaction with social life, satisfac- 
tion with the opportunity for instructor-stu- 
dent discussion, and satisfaction with their 
social interaction with fellow students will be 
more highly correlated with overall satisfac- 
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tion than for males. For males, satisfaction 
with the voice they have in influencing de- 
partmental policy, satisfaction with their in- 
come relative to other graduate students, and 
satisfaction with the opportunity for inde- 
pendent thought and action will be more 
highly related to overall satisfaction, than for 
females. 

Finally “raw” survey results considered of 
interest to the scientific community will be 
presented. Graduate students, of course, will 
become the psychological community of the 
future, and their present attitudes toward 
their training, their projected careers, their 
mentors, may play a large role in determin- 
ing the characteristics of that community. To 
the extent that our results are generalizable, 
some insights may be obtained from the 
present data. 


PROCEDURE 
The Questionnaire 


The Graduate Student Questionnaire was designed 
in the following manner: 


1. Of the 78 items on the questionnaire, 35 covered 
a number of specific areas of possible satisfaction or 
dissatisfaction along with overall satisfaction. For 
the majority of the items, a 7-point scale was used, 
ranging from very satisfied to very dissatisfied. Of 
the 35 items, 5 were overall measures, for example, 
“How satisfied are you with your overall graduate 
training?”; 7 items were rated intrinsic to the stu- 
dents’ work, for example, How satisfied are you 
with the intellectual stimulation provided by your 
graduate training?”; 19 items were rated extrinsic to 
the students’ work, for example, “How satisfied are 
you with your social life?” The remaining 4 items 
could not be classified into any of the above cate- 
gories, for example, “How satisfied are you with 
your decision to become a psychologist?” 

The classification of items into intrinsic, or mo- 
tivator, variables and extrinsic, or hygiene, variables 
followed the scheme presented by Herzberg et al. 
(1959). A consensus of opinion among the authors 
and our colleagues (see Footnote 1) was required for 
the assignment of items to each of the categories 
mentioned. Where consensus could not be reached, 
or where the item did not lend itself to any meaning- 
ful classification, no assignment was made. j 

2. Five items asked what the respondent’s primary 
career goals were. The item format specified that the 
respondent should divide the 100% of his work time 
among the categories “teaching,” “research in an 
academic setting,” “research in a non-academic set- 
ting,” “professional practice,” “administration,” an- 
the ubiquitous “other.” 

3. The respondents were also asked to suggest the 
foundation courses which all students in psychology 
should take. 
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4. A number of other items covered a variety of 
topics such as respondent's satisfaction with his 
grades, to how many other schools he applied and 
was accepted, whether his present university was his 
first choice, and whether, if given another chance, 
he would still choose his present 8 

5. Another block of items listed certain profes- 
sional skills a psychologist might need in his work, 
for example, experimental design, statistical analysis, 
etc. The respondent was asked whether this skill was 
important or not important to him, and whether he 
was satisfied or dissatisfied with his training in that 
skill. 

6. There were two open-ended questions concern- 
ing what the student liked best and least about his 
graduate training (“Bouquets” and “Brickbats’’). 

7. The last group of items concerned the follow- 
ing demographic characteristics: major area of 
study, sex, age, marital status, student status, years 
pursuing graduate study, form of university financial 
assistance, if any, distance from place of residence 
to the graduate school, which university the re- 
spondent was attending, and whether or not he was 
a matriculant only (taking no courses), or a full-time 
student. 


This questionnaire, along with a cover letter which 
explained the purposes of the questionnaire and gave 
its sponsorship, was mailed to all psychology gradu- 
ate students at two universities, Anonymity was 
guaranteed. 


Sample 


The sample consisted of graduate students enrolled 
in the psychology department at two. universities, 
one rather large graduate department, and one rela- 
tively small. 

While the total response rate for the larger uni- 
versity was 53%, the primary results to be pre- 
sented here will be on 93 cases, all of whom are 
full-time students taking courses. This represents 70% 
of the respondents. 

For the small university a 61% return was ob- 
tained. 


RESULTS ° 


Tot. refers to the total group in University 
A taking courses, V = 93. ; 

MO refers to the group maintaining matric- 
ulation only, W = 40, at University A. 

B refers to the second University, N = 19. 


Factor Analysis 


Responses of Tot. on the 35 items concern- 
ing specifics of graduate training and overall 
satisfaction were subjected to a factor analy- 
_ 

2 The authors gratefully acknowledge the . 
tion of John D. Hundleby to the analyses reported. 
Tom Keenan, Chuan-Yu Chen, and Thanton Porter 
helped prepare the data for computer processing. 


sis. The method of principal axes was used 
with an Equamax rotation.“ The orthogonal 
factors which emerged were then rotated so 
that they became oblique by a Promax rota- 
tion (Hendrickson & White, 1964), and sec- 
ond-order factors were extracted by Varimax 
(Kaiser, 1958), Squared multiple correlations 
were placed in the diagonals, and extraction 
was halted when the number of loadings of 
items on a factor above .30 was less than two. 
That the number of factors extracted in the 
first order is the correct number is given 
credence by the finding that the number ex- 
tracted would have been the same if another 
criterion independent of this one had been 
used, that is, the number of eigenvalues of the 
correlation matrix, with unities in the di- 
agonals, greater than one. As many second- 
order factors were extracted and rotated suc- 
cessively as first-order factors. An upper- 
bound estimate of the reliability of these 
items, as given by the Spearman-Brown for- 
mula, using the average item intercorrelation 
(Guilford, 1954), was .91. To the extent that 
the assumptions of the formula are not met, 
the coefficient is overstated. However, Guil- 
ford (1954, p. 378) points out that a coef- 
ficient above .60 will generally not be more 
than .05 in error, and reliabilities of this mag- 
nitude are not extraordinary for items of this 
type (Edwards, 1957). 

Missing data were handled by ignoring 
those pairs of observations where one or both 
observations in a pair were not included. The 
items containing missing data were primarily 
concerned with assistantship jobs and could 
not, therefore, be expected to apply to every- 
one. However, none of the Ns was lower 
than 70. 

The factors themselves emerged as follows: 
(Only loadings above .30 are considered in 
view of the small V. Furthermore, the items 
loading on each factor will be listed by their 
questionnaire number, and following each 
item in parenthesis will be the designations I 
for intrinsic, E for extrinsic, O for overall, 
and X for unclassifiable measures, as well as 
the factor loadings.) 

Factor 1. General Satisfaction—measures 
satisfaction with: 


3 David Saunders has devised this criterion. 
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65. Graduate student life, everything con- 
sidered (O, .74) ; 
1. Overall graduate training (O, .71); 
40. Preparedness to achieve goals (O, .64); 
6. Decision to become a psychologist (X, 
59); 
33. The degree to which the department 
lived up to expectations (O, 8705 
10. Quality of instruction (E, . 56); 
32. The training in major area of interest 
(0, .52); 
3. Anticipated future as a psychologist (I, 
50); 
7. Intellectual stimulation from graduate 
training (I, 48); 
. The intellectual climate in the depart- 
ment (E, .41). 


Factor 2. A complicated factor which loaded 
in order of magnitude on satisfaction with: 


E 


25. Academic performance of fellow stu- 
dents (E,. 79); 
26. Intelligence of fellow students (E,. 74); 
9. Prestige of department (E, .59) ; 
8. Intellectual climate in the department 
(E, 51); 
23. Work and study interaction with fellow 
students (E, .45); 
7. Intellectual stimulation from training 
(J,. 45); 
33. The degree to which the department 
lived up to expectations (O, 44); 
10. Quality of instruction (E, 42); 
32. The training in major area of interest 
(O, 33); 
30. The amount of required work in your 
courses (E, 32). 


This factor has been named Satisfaction 
with Intellectual Stimulation. However, it also 
seems to measure the quality of peers, the de- 
partment, and the instruction, along with in- 
tellectual stimulation. 


Factor 3. Satisfaction with Assistantship 
Job measures satisfaction with: 


4. Social relationship with your assistant- 
ship supervisor (E, .83) ; 

27. Recognition for work as an assistant 
(I, 81); 

28. Feedback on performance as an as- 
sistant (I, .64); 
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15. Income relative to other graduate stu- 
dents (X, .59); 

5. Competence of assistantship super- 
visor (E, .41); 

17. Research space available to you (E, 
34). 
Factor 4. Satisfaction with Physical En- 
vironment and Setting—measures satisfaction 
with: 
16. Study space available (E, .72) ; 
18. The overall physical plant at your uni- 
versity (E, .71); 

19. Recreational and cultural facilities pro- 
vided by your university (E,. 58); 

14. The library facilities (E, . 58); 

17. Research space available to you (E, 
56). 


One item, feedback on your performance as 
an assistant, loaded .40 on this factor; how- 
ever, the other item loadings were above .55, 
so that in this case we thought it justifiable to 
exclude this item. 


Factor 5. Satisfaction with Constraints 
Placed upon the Student—measures satisfac- 
tion with: 

29. Emphasis placed on grades (E, . 67); 

22. Freedom in choosing course work (1, 

5105 

12. Progress toward the PhD (I, 48); 

9. Prestige of the department (E, 45); 

21. Opportunity for independent thought 

and action (I,. 41); y 
13. Voice in setting departmental policy 
(X, 40); 

31. Time limits placed on the student (E, 
39); 

30. Amount of required work in courses (E, 
36); 

1. Overall graduate training (O, 35); 

17. Research space available (E, 32); 

11. Opportunity for instructor-student dis- 
cussion (X, .30). 

The only item that causes difficulty is Item 
9, prestige of the department. Perhaps we can 
resolve the presence of this item on the basis 
that if department prestige is higher the st 
dent will have greater latitude in choosing a 
position, in publishing research, in making 
contacts, In sum, a department’s prestige can 
“open doors” for the student. 
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Factor 6. A complicated factor, loading on 
satisfaction with: 


24. Social interaction with fellow students 
(E, .70); 
23. Work and study interaction with fellow 
students (E, .62); 
2. Social life (E, .57); 
3. Future as a psychologist (I, 48); 
6. Decision to become a psychologist (X, 
47); 
12. Progress toward the PhD (I, 43). 


For want of a better label this factor will be 
called Satisfaction with Social Future. 


Factor 7. Satisfaction with Intellectual 
Stimulation and Freedom to Pursue Intellec- 
tual Interests—measures satisfaction with: 


5. The competence of your assistantship 
supervisor (E, .59); 
21. The opportunity for 
thought and action (I, .54); 
7. The intellectual stimulation provided 
by graduate training (I, .46); 
8. The intellectual climate in the depart- 
ment (E, 44); 
19. The recreation and cultural facilities 
provided by your university (E, 42); 
32. The training in major area of interest 
(O, (42); 
31. The time limits placed on the student 
(E, 37); 
30. The amount of required work in your 
courses (E, 30); 
22. The amount of freedom in choosing 
course work (I, 30). 


The three second-order factors, which ac- 
count for virtually all the variance in the 
first-order factors’ correlation matrix, are in- 
teresting in how they group those of the first 
order (see Table 1). 

Second-Order Factor 1. This factor groups 
general satisfaction (Factor 1), intellectual 
stimulation (Factor 2), and constraints 
placed upon the student (Factor 5). In terms 
of intrinsic and extrinsic variables, all the in- 
trinsic items (3, 7, 12, 21, 22) are represented 
here except the two concerning feedback as 
an assistant and the recognition received as 
an assistant, while only 10 of 19 extrinsic 
items appear (8, 9, 10, 17, 23, 25, 26, 29, 
30, 31), 


? 


independent 
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TABLE | 
Srcoxp-Ornza Factors 
Factor | 1 2 5 
1 562 254 — 003 
2 455 886 ? 
3 3 488 130 
4 146 239 | 50 
5 1 570 156 056 
6 203 373 ~ 241 
7 081 | 356 084 


Second-Order Factor 2. This factor groups 
satisfaction with assistantship job (Factor 3), 
satisfaction with social future (Factor 6), 
satisfaction with intellectual stimulation and 
freedom to pursue intellectual interests (Fac- 
tor 7). In terms of extrinsic and intrinsic 
factors, all of the intrinsic items appear (3, 7, 
12, 21, 22, 27, 28), while only 10 of 19 ex- 
trinsic items appear (2, 4, 5, 8, 17, 19, 23, 
24, 30, 31). 

Second-Order Factor 3. This factor isolates, 
to a large extent, satisfaction with physical 
environment and setting, although satisfaction 
with intellectual stimulation is represented 
and, to a small degree, satisfaction with so- 
cial future. Only two of the seven intrinsic 
items appear (considering only First-Order 
Factors 2 and 4 which load .30 or above). 
Twelve of 19 extrinsic items are represented 
(1, 9, 10, 14, 16, 17, 18, 19, 23, 25, 26, 30). 

Some variance of general satisfaction is ac- 
counted for by Second-Order Factors 1 and 2, 
where all of the intrinsic items appear, but 
none at all in Second-Order Factor 3 where 
only 2 of 7 intrinsic and 12 of 19 extrinsic 
items appear. 

Finally, that First-Order Factors 2 and 6, 
the social factors, were complicated (cf. Hulin, 
1966a) is made plainer by the second-order 
analysis. Factor 2, satisfaction with intellec- 
tual stimulation, has some variance repre- 
sented on Second-Order Factors 1 and 4. Fac- 
tor 6, satisfaction with social future, has its 
variance fairly evenly divided among Second- 
Order Factors 1, 2, and 4. One can only con- 
jecture that the “social” aspects of these di- 
mensions are represented on the more “ex- 
trinsic”? Second-Order Factor 4, while the more 
job-related, self-actualizing, intellectual por- 
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TABLE 2 
INTERCORRELATIONS OF OVERALL SATISFACTION MEASURES 


Question 1 


32 65 | 40 | 33 


erg MM ŘE 
How satisñed are you with: 
1. overall graduate training? 
32. training in major area? 
65. graduate student life, overall? | 
40. Will you be prepared to reach your goals? 
33. Did department meet expectations? | 


tions are represented in Second-Order Factors 
1 and 2, the more “intrinsic” second-order 
factors. 

Interestingly, Item 20, satisfaction with 
recreational and cultural facilities in the city 
in which your school is located, does not have 
a salient loading on any factor, the only vari- 
able for which this is true. 


Multiple-Regression Analysis 


A Wherry stepwise multiple-regression anal- 
ysis (Wherry, 1940) was employed to de- 
termine the relationships of intrinsic and ex- 
trinsic items to overall satisfaction. The five 
overall measures, whose intercorrelations are 
presented in Table 2, were employed as de- 
pendent variables. 

The analysis was done in three stages. 
First, all seven intrinsic variables were cor- 
related with overall satisfaction, then the 19 
extrinsic variables, and finally all 26 extrinsic 
and intrinsic variables together. The multiple- 
regression coefficients (see Table 3) are ob- 
viously in contrast to what the two-factor-plus- 


TABLE 3 


COMPARISON OF THE RELATIONSHIPS OF INTRINSIC AND 
EXTRINSIC VARIABLES TO OVERALL SATISFACTION 
MULTIPLE-REGRESSION COEFFICIENTS 


Criterion Intrinsic Extrinsic 
variable* variables variables 
1 68 76 (.71) 

32 62 73 (.69) 
65 2 79 (76) 
40 57 50 (47) 
33 67 85 (.83) 


_Note.—The coefficients in parentheses are the correlations 
yielded by considering only the best seven extrinsic variables, 
a See Table 2 for item stems. 


a — 4 
1,00 49 66 50 57 
1,00 48 44 62 
1.00 61 60 
1.00 45 


importance hypothesis predicts, since the 
extrinsic variables tend to predict overall satis- 
faction better in four out of five cases. The 
differences in coefficients do not seem espe- 
cially great, but the differences magnify some- 
what when one considers the variance ac- 
counted for. 

A note of caution is in order, for one might 
well question this evidence on the basis that 
we “stacked the deck” against the intrinsic 
variables, since we reported the results for all, 
and the best seven, variables from the ex- 
trinsic set. However, such an objection cannot 
be sustained when the regression equations of 
both intrinsic and extrinsic variables on over- 
all satisfaction are computed. The sum of the 
contributions of each variable to the variance 
of the overall measures will then give a truer 
indication of the relative importance of each 
set of variables, We will arbitrarily set the 
number of variables included in this analysis 
to 10, since most of the systematic variance 
will be accounted for, and the possible effect 
of the larger number of extrinsic variables 
will be offset. 

The results, as shown in Table 4, are still 
contrary to the two-factor-plus-importance 
theory, for in three of five cases extrinsic 
variables account for more variance. Such à 
result also implies that the use of a one-item 
measure of overall satisfaction in tests of a 
theory is a far from adequate strategy. 

Furthermore, while these results were un- 
expected, they were not unprecedented. Hin- 
richs and Mischkind (1967) report thats 
contrary to the two-factor and the two-factor- 
plus-importance theory, “hygienes” were re- 
ported more often than “motivators” as Tea 
sons for both satisfaction and dissatisfaction, 
by both high- and low-satisfaction people. 
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Survey Results 


1. The predictions concerning the moderat- 
ing effects of sex are presented in Table 5. 
Two of the predictions hold up: for males, 
satisfaction with independent thought and ac- 
tion (p< .05) and voice in departmental 
policy (p < .07) are more tly related 
to overall satisfaction than for females. How- 
ever, the reverse of the prediction that satis- 
faction with instructor-student discussion 
would yield a higher relationship with overall 
satisfaction for females was borne out ( < 
01). 

2. “Raw” survey results for variables which 
have been judged important by content, and 
by their importance to overall satisfaction, 
reveal the following for B, MO, and Tot.: 


All three groups answer all overall satis- 
faction items “slightly satisfied,” with B some- 
what more satisfied on the average. 

All respondents were slightly satisfied to 
satisfied, on the average, with their anticipated 
future as a psychologist, their decision to be- 
come a psychologist, the intellectual stimula- 
tion provided by their training, the intellec- 
tual climate in their department, their prog- 
ress toward the PhD, the opportunity for 
independent thought and action, and the 
amount of required work in their courses. 
However, the major, and indeed the only out- 
standing, source of dissatisfaction for all 
groups was with their voice in influencing de- 
partment policy, a result consistent with the 
recent nationwide student unrest. 


3. Statistics, psychometrics, and research 
design, along with history and systems, were 
endorsed most often by all groups as desirable 
basic courses for all students in psychology 
to take. 

4. According to the open-ended question 
requesting “Bouquets” and “Brickbats,” the 
faculty, its general quality, supervisory and 
technical competence, and breadth of outlook, 
was the most important source of satisfaction 
as well as dissatisfaction in both universities. 


Discussion 


Within the limits set by the sample size, 
our questionnaire seems to be a fairly reliable 
instrument, and quite interesting factorially. 
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It is gratifying that, for the most part, the 
factors are clear cut; only in Factor 2 and 
Factor 6 are the interpretations strained, 


Nemar's (1951) caveat concerning overinter- 
pretation perhaps does not apply here. Fur- 
thermore, the factors correspond quite cle 
to the coding categories set up for the “Brick- 
bat“ and “Bouquet” items; freedom, assistant- 
ship job, physical environment, level of over- 
all training, peers, are all clearly represented. 
Our results, as they bear on the con- 
troversy surrounding the two-factor theory 
and its alternative, add more evidence that 
the intrinsic-extrinsic dichotomy is not em- 
pirically useful, Our Second-Order Factors 1 
and 2 can be at best characterized as more 
intrinsic than extrinsic, and vice versa for 
Second-Order Factor 3. The complexity of 


TABLE $ 


COMPARISON of THE RELATIONSHIPS BETWEEN VArt- 
anes 2, 11, 13, 15, 17, 21, 24 axo Ovrasir 
Sar CG NAA! 


ä— — — — 


270 


this intrinsic-extrinsic dichotomy is further 
shown by the fact that First-Order Factor 2, 
a predominantly extrinsic factor, loads most 
highly with the Second-Order Factor 1, where 
the general satisfaction factor appears. It 
seems that the context within which a variable 
is embedded must be taken into account in 
terms of its relevance for the satisfaction of 
self-actualization, or of more basic needs. For 
example, quality of instruction, for a graduate 
student, bears heavily on intellectual stimula- 
tion and overall satisfaction, while it may 
mean little to a sales trainee taking a 1-week 
course. Advancement is quite the “motivator” 
for an upwardly mobile business executive, 
but for certain civil servants, whose promotion 
may depend more upon time, it may have 
little, if any, motivational implications. 

Furthermore, in terms of effects on overall 
satisfaction, the hypothesis that intrinsic fac- 
tors are more important, which must make 
the damaging assumption of an intrinsic-ex- 
trinsic dichotomy, does not hold up. Does it 
not seem strange to expect that, when more 
sophisticated means of measuring importance 
have failed (Ewen, 1965), merely classifying 
variables as intrinsic or extrinsic should allow 
one to achieve meaningful results? 

Nature is frequently unparsimonious. Al- 
though a simple explanation is desirable, it 
must not be so simple as to mask, rather than 
mirror, the explicandum (Chein, 1963; cf. 
also Dunnette et al., 1967). What is needed 
in the area of job satisfaction is a broadsided 
attack on problems of definition and theory 
where expectations (Weitz, 1956; Wernimont, 
1966), community characteristics (Hulin, 
1966a), and other situational parameters, in- 
cluding employee needs (Katzell, Barrett, & 
Parker, 1961), are taken into account. (For 
the most sophisticated attempts along these 
lines see Hinrichs and Mischkind, 1967, and 
Mischkind, 1967, where a model of job satis- 
faction based on social learning theory is at- 
tempted.) Indeed, is it reasonable to ap- 
proach the study of job satisfaction in a more 
simple-minded fashion than the study of 
modern personnel selection and placement 
(Dunnette, 1963), or to assume that the 
dynamics at work should be any less complex? 


Epwarp L. Levine AND JoserHu WEITZ 


REFERENCES 


Burke, R. J. Are Herzberg’ motivators and hy- 
gienes unidimensional? Journal of Applied Psy- 
chology, 1966, 50, 317-321. 

Cuer, I. The image of man. Journal of Social Jy. 
sues, 1963, 19, 1-35. 

Dunnetre, M. D. A modified model for test valida. 
tion and selection research. Journal of Applied 
Psychology, 1963, 47, 317-323. 

Duxxrrrr, M. D., Campsett, J. P., & Haxer, M. D 
Factors contributing to job satisfaction and job 
dissatisfaction in six occupational groups. Organisa- 
tional Behavior and Human Performance, 1967, 
2, 143-174. 

Epwarps, A. L. Techniques of attitude scale con- 
struction, New York: Holt, Rinehart & Winston, 
1957. 

Ewen, R. B. Some determinants of job satisfaction: 
A study of the generality of Herzberg's theory. 
Journal of Applied Psychology, 1964, 48, 161-163. 

Ewen, R. B. Weighting components of job satisfac- 
tion. Unpublished doctoral dissertation, University 
of Illinois, 1965. 

Ewen, R. B., Smiru, P. C., Hutt, C. L., & Locke, 
E. A. An empirical test of the Herzberg two-fac- 
tor theory. Journal of Applied Psychology, 1966, 
50, 544-550. 

FRIEDLANDER, F. Underlying sources of job satisfac- 
tion. Journal of Applied Psychology, 1963, 47, 246- 
250. 

Frrepianper, F. Motivations to work and organiza- 
tional performance. Journal of Applied Psychology, 
1966, 50, 143-152. 

Gutsertt, E. A. Moderating effects and differential 
reliability and validity. Journal of Applied Psy- 
chology, 1963, 47, 81-86. 

Grarn, G. B. Addendum to “An empirical test of 
the Herzberg two-factor theory.” Journal of Ap. 
plied Psychology, 1966, 50, 551-555. (a) 

Graer, G. B. Motivator and hygiene dimensions for 
research and development engineers. Journal of Ap- 
plied Psychology, 1966, 50, 563-566. (b) 

Guitrorp, J. P. Psychometric methods. New York: 
McGraw-Hill, 1954. 

HaLPERN, G. Relative contributions of motivator 
and hygiene factors to overall job satisfaction. 
Journal of Applied Psychology, 1966, 50, 198-200. 

Hennrtcxson, A. E., & Warre, P. O. Promax: A 
quick method for rotation to oblique simple struc- 
ture. British Journal of Mathematical and Sta- 
tistical Psychology, 1964, 17, 65-10. T 

Herzserc, F. Work and the nature of man. New 
York: World Press, 1966. 

Herzeerc, F., Mausner, B., & SNYDERMAN, B. B. 
The motivation to work. New York: Wiley, 1959. 

Hivricus, J. R., & Miscuxinn, L. A. Empirical an¢ 
theoretical limitations of the two-factor hypothesis 
of job satisfaction. Journal of Applied Psychology, 
1967, 51, 191-200. 5 

Hurm, C. L. Effects of community characteristics 0” 
measures of job satisfaction. Journal of Applie 
Psychology, 1966, 50, 185-192. (a) 


P 


1961, 45, 65-72. 

McNemar, Q. The factors in factoring. Psyche- 
metrika, 1951, 15, 353-359. 

Miscuxixp, L, A. A two-factor theory of job satis- 
faction; A multi-method and situational analysis. 


Journal of Applied Psychology 
1968, Vol. 52, No. 4, 272-279 


CRITERIA USED BY RESEARCH AND DEVELOPMENT 
ENGINEERS IN THE SELECTION OF AN 
INFORMATION SOURCE* 


PETER G. GERSTBERGER anv THOMAS J. ALLEN 


Massachusetts Institute of Technology, Sloan School of Management 


The criteria employed by engineers in the selection of various technical informa- 
tion channels in problem-solving endeavors are investigated. The relationships 
of certain criteria identified in past research to such factors as frequency of 
channel utilization and the rate at which engineers accept or reject technical 
information received from specific channels are the focal points of the study. 
A direct relationship is found between perceived accessibility of information 
channels and several objective measures of utilization, whereas no definite sup- 
port is found for the hypothesis that the channels perceived highest in technical 


quality are those used most frequently, 


Before information technologists are able 
to design effective systems to provide the 
scientist and engineer with the information 
necessary for his work, a great deal must be 
learned about the behavior of the user popu- 
lation. Such research is requisite both to 
assess needs and to improve predictions of 
the impact on the total information system 
of new components or modifications of exist- 
ing ones. 

As a result, a large number of what have 
come to be called “user studies” (investiga- 
tions of the information-seeking behavior of 
scientists and engineers) have been performed 
over the past 10 years. Such studies endeavor 
to determine (a) the effectiveness of cur- 
rently used information channels, or (b) the 
criteria governing the selection of information 
channels. Most such studies, until very re- 
cently, have not, however, differentiated these 
two goals. They have implicitly assumed 
channel effectiveness to be the single criterion 
governing channel selection. Operating from 
this assumption, it was necessary only to 
observe the users and to assign performance 
measures to the channels on the sole basis of 
the extent to which they are used. 

Allen (1966a, 1966b), applying an inde- 
pendent measure of effective channel per- 
formance, found no relation whatever between 
this measure and the extent to which each 
of eight channels were used, These results 

1 The research reported in this paper was supported 


by a grant from the Office of Science Information 
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counter any assumption of information qual- 
ity as the single criterion upon which channel 
selection is based. Allen proceeds to demon- 
strate the by now obvious fact that there 
exists in such a decision process a second 
parameter of considerable consequence. In 
addition to the value of a given channel, there 
is a certain cost associated with using it. The 
present study is directed at the measurement 
of relative costs associated with the use of 
each of nine information channels by R&D 
engineers and the relative weights assigned 
cost and value in this decision process (Gerst- 
berger, 1967). 

Cost in this sense is a rather complex con- 
cept. It may consist of many dimensions— 
economic, psychological, and physical. This 
study measures perceived cost to the user in 
terms of channel accessibility and ease of use. 
There is considerable support for the idea 
of measuring cost on these two dimensions. 
First, Allen’s (1966a) data led him to hy- 
pothesize that the frequency with which a 
channel is used is principally determined by 
the accessibility of the channel to the engi- 
neer seeking information. Second, both 
Werner (1965) and Rosenberg (1966) com 
clude that the behavior of the users they 
studied appears to be influenced by percep- 
tions of the ease with which an information 
channel might be used. 
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Massachusetts Institute of Technology 
Solution Development Record 
Name = Date: 
Problem: c= apie 
a. estimate of probabilit ‘that 
alternatives under consideration — will — oa 
circle a probability estimate for 
each alternative) 


. ee ee „ BoD o AB 


eS nnen e 


— 
= 
< 
— 
= 


ae ie ee 28. « os 56 & 9 if 


2 


O- nah eee eee 
(Circled Probabilities Should Add to 1.0) 


b. If technical information having an impact upon your visualization of the problem or any of its potential solutions 
was received at any time during the past week, please circle the source(s) of that information on the line below. 


Information Source(s): L V C ES TS CR G E D 


e. Please try to recall two instances during the past week when you were actively searching for technical informa- 
tion related to the problem under consideration. For each case, please fill in a description of the type of informa- 
tion you sought and indicate the search order of all information channels consulted even though some or possibly 
all yielded little or no relevant information. (Place a “1” in the space provided in front of the source you con- 
sulted first; place a “2” in front of the source you consulted second; and so forth.) : 


(1) Type of information sought: 


Search order: I. V. P ee 


(please specif; ry): — — 


(2) Type of information sought: 


Search order: VC ES < e aD ‘Other 


(please specify) 


Did you obtain the information sought in (1): MENE i. No in (2): — ves No 


Comments: 


Thank you! 
Fi. 1. The Solution Development Record. (This device 
and Development projects. Respondents report, 


vendors; C = customer; ES = eS} 
discussion; E = experimentation; D = other division. 
on the reverse side of this device.) 
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TABLE 1 


INFORMATION CHANNELS 


Channel 


Description 


L = literature 


V = vendors 
C = customer 
ES = external sources 


Books, professional, technical, and trade journals and other publicly accessible written 
material. 

Representatives of, or documentation generated by, suppliers or potential suppliers of 
design components. 

Representatives of, or documentation generated by, the government agency for which 
the project is performed. 

Sources outside the laboratory or organization which do not fall into any of the above 
three categories. These include paid and unpaid consultants and representatives of 
government agencies other than the customer agency. 

Engineers and scientists in the laboratory who are not assigned directly to the project 


Any other project performed previously or simultaneously in the laboratory or organiza- 


TS = technical staff 
under consideration. 
CR = company research 
G = group discussion 
group. 
E = experimentation 
D = other division 


tion regardless of its source of funding. This includes any unpublished documentation 
not publicly available, and summarizing past research and development activities. 
Ideas which are formulated as the result of discussion among the immediate project 


Ideas which are the result of test or experiment or mathematical simulation with no im- 
mediate input of information from any other source. 
Information obtained from another division of your company. 


initially selected; of these, 19 remained fully coopera- 
tive over the 15-wk. period of the study.? Of the 
19 cooperating engineers, 8 are employed in one di- 
vision, 11 in another. Those who are employed in 
the first division are further subdivided into those 
who work in a components laboratory and those who 
are with a systems laboratory. The two laboratories 
are separated by about 4 mi.; the second division is 
18 mi. away. The mean age of the 19 cooperating 
engineers is 33 yr. and the range 24—43 yr. Six have 
MS degrees; the remainder all hold BS degrees in 
engineering. They have been with their present com- 
pany an average of 54 yr. 

Data were gathered by means of a modified ver- 
sion of the Solution Development Record (Allen, 
1966c) (Figure 1). In addition to the questions about 
the probability of accepting alternative-solution can- 
didates to the S's problem and the use of specific 
information channels, each S was asked each week 
to report in more detail on two information searches. 
Specifically, Ss were asked to report the type of in- 
formation sought, the order in which information 
channels were approached, and an indication of the 
success of the search. 


2A comparison of the 19 cooperating engineers 
with the 14 noncooperative ones shows no reason to 
suspect that they differ in terms of any background 
characteristics or (from the evidence available) in 
terms of their information-gathering behavior. 

3 The two divisions operate independently of one 
another, and the nature of their business is quite dif- 
ferent. One division is in the missile systems business; 
the other is in the business of developing and pro- 
ducing electronic components. All relations reported 
in this paper, unless otherwise indicated, hold true 
for both divisions, when each is considered separately. 


The modified Solution Development Record was 
supplemented by periodic questionnaires in which 
each S was asked to rank nine information channels 
(Table 1) on the basis of the following four criteria: 

1. Accessibility—accessibility of the nine informa- 
tion channels (without giving consideration to the re- 
liability or quality of the information expected). 3 

2. Ease of Use—ease of use of the nine information 
channels listed. 

3. Technical Quality—technical quality or reliabil- 
ity of the information obtainable from each of the 
nine channels listed (without giving consideration to 
the accessibility of the channel). 

4. Degree of Experience—degree of experience the 
engineer has had with the nine channels during his 
career, 3 

Each of the four channel-ranking questionnaires 
was administered at a separate point in time over 
the course of the study. The minimum separation 
between questionnaires was 1 wk. Upon completion 
of the study, each S was questioned to determine 
whether he understood clearly the meaning of eac 
of the four dimensions. In a few cases, in which an 
S exhibited confusion over the meaning of a dimen” 
sion, this difficulty was corrected by the interviewe", 
and S was allowed to rerank the channels. x 

The resulting rank order of information channels 
for each criterion provides subjective estimates 0 
“channel cost” and “channel payoff,” or value. These 
channel rankings can be tested for correlation wit 
rank orders based on the frequency-of-use deter is 
from Solution Development Record data. This tech” 
nique makes possible the identification of the rel 
importance of various criteria in information-channe 
selection. The postproject interview was also use 
to determine the information sources which ha 
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brought to mind cach of 
considered by each engiorer 


Resutrs 


Seven hypotheses are tested in the study, 
All are concerned with predicting the criteria 
which determine both the frequency with 
which an information channel is used and the 
order in which it is consulted by an engineer 
requiring information; 

1. Those information channels which are 
considered more readily accessible will be 
used more frequently than less accessible 
channels. 

2. Those information channels which are 
considered easier to use will be used more 
frequently than less easy to use channels, 

3. Those information channels which are 
believed to provide information of higher 
quality will be used more frequently than 
those providing lower quality, less reliable 
information. 

4. Those information channels which are 
considered more readily accessible will be 
used first in the course of an active informa- 
tion search. 

5. Those information channels which are 
considered easier to use will be used first in 
the course of an active information search. 

6. Those information channels which are 
believed to provide information of higher 
quality or reliability will be used first in the 
course of an active information search. 

7. A greater proportion of ideas (measured 
by the ratio of messages accepted to those 
received) will be accepted from those in- 
formation channels which are believed to 
provide information of higher quality or 
reliability. 


the solution alternatives 


Frequency of Use 

Testing the first hypothesis, a strong rela- 
tion (Kendall Tau = .67, p .01) is found 
(Table 2) between channel accessibility and 
frequency of use. This first-order test pro- 
vides substantial support for the first hy- 
pothesis, Turning to the second criterion, 
ease of use, a somewhat weaker relationship 
(r= 44, p<.06) is found between the 
channel rankings. The application of partial 
correlation analysis (Table 3) to the data 
clearly reveals the magnitude and direction 
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of the relationship. Accessibility is definitely 
the dominant cost criterion determining the 
relative frequency with which information 
channels are used. When accessibility is held 
constant, ease of use actually shows a slightly 
negative relation with frequency of use. Simi- 
larly, in the case of perceived technical qual- 
ity, there is a weak positive relation with 
frequency of use, but when accessibility is 
controlled, the relation approaches zero, Ap- 
parently, in the minds of the Ss, there is some 
relation between their perceptions of tech- 


TABLE 3 


Retarion OF Tierz Serocnox CITTI to 
FREQUENCY oF Ixroxwation Croce Use 
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TABLE 4 


RELATION OF THREE SELECTION CRITERIA TO 
FREQUENCY OF CHANNEL SELECTION AS A 
First Source 


Correlation be- 
tween frequency | _ Kendall Variable 
. Kendall| Tau , 
of selection as jas partialed 
a first source 855 (par- out 
tialed) 
and 
Perceived ac- 
cessibility 87 57 Ease of use 
61 Technical 
quality 
Perceived ease 
of use 44* —.16 Accessibility 
31 Technical 
quality 
Perceived tech- 
nical quality 39 19 Accessibility 
2¹ Ease of use 
* 
ap Saat 


nical quality and channel accessibility, but 
it is the accessibility component which al- 
most exclusively determines frequency of use. 


First Sources 


Each S$ was asked once a week to report 
two instances during the preceding week in 
which he actively sought information. In each 
instance, Ss reported whether or not the 
search was “successful” and the order in 
which they contacted channels. There were 
154 searches reported. 

Table 4 presents the correlations between 
frequency of selection as a first source and the 
three criterion variables, Here again acces- 


TABLE 5 


MEDIAN RANK ORDERING OF INFORMATION CHANNELS 
ON THE Basis OF DEGREE OF EXPERIENCE 


Information channel Ranking assigned 


Literature 

Group 
Experimentation 
Technical staff 
Vendors 
Company research 
Customer 

Other division 
External sources 


O n hwrne 


Note. V= 19 R&D engineers. 


sibility appears as the dominant criterion 
upon which selection is based. Engineers turn 
first to that channel which is most accessible; 
perceived technical quality influences this 
decision only to a minor extent. 


Effect of Experience 


One would expect that the degree of experi- 


ence which an engineer has with a given 
information channel would influence his per- 
ception of both costs and value associated 
with that channel. He might never use the 
library, for example, because of a mispercep- 
tion that it contains nothing of value, or that 
it is really too difficult to find what you are 
looking for. Given some moderate amount of 
experience with library use, however, he may 


are more in accord with reality. Similarly 
with interpersonal channels, increased experi- 
ence should adjust perceived costs. The ad- 
justment can, of course, mean either an in- 
crease or decrease in perceived cost, and may 
be entirely random, increasing for some indi- 
viduals or channels and decreasing for others. 
Should the latter situation hold, correlation 
analysis will fail to detect any systematic 
trend, and a zero correlation will be found 
between experience and perceived cost. If, 
however, experience brings about a systematic 
shift (most Ss shifting in the same direction 
with respect to most channels), correlation 


TABLE 6 


CORRELATION AND PARTIAL CORRELATION Co- 
EFFICIENTS FOR RELATION BETWEEN DEGREE 
OF EXPERIENCE AND PERCEIVED Costs 
OF USING INFORMATION CHANNELS 


come to adjust these perceptions so that they 


Correlation be- Kendall able 
tween degree | Kendall] Tau bien ed 
of experience Tau (par- P ae 
and tialed) 
Accessibility woe 37 | Ease of use 
53 Frequency of use 
Ease of use 72 37 Accessibility 
60 Frequency of use 
Frequency of use .61** 25 Accessibility 
51 | Ease of use 
Technical quality | . 56“ 43 Accessibility 
32 Fase of use 
30 Frequency of use 
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analysis can detect this phenomenon and will 
show a positive or negative correlation be- 
tween experience and cost. 

Table 5 shows the nine channels arranged 
according to the amount of experience the 
engineers report having had with each of 
them. Table 6 demonstrates quite clearly that 
a strong positive relationship holds between 
the degree of experience an engineer has had 
with a given channel and both perceived 
accessibility and ease of use. 

Of course, the engineer may simply refer 
more frequently to those channels which he 
sees as more accessible or easier to use. In 
other words, the direction of causality may 
be from cost to experience. Lower-cost chan- 
nels are used more, and he thereby accumu- 
lates greater experience with them. To test 
this possibility and determine causal direc- 
tion, the frequency with which the engineers 
used the nine channels is controlled and the 
relation between experience and perceived 
cost again tested (Table 6). This reduces 
both of the original relations somewhat, but 
reasonably strong correlations remain, thus 
indicating that the degree of experience that 
an engineer acquires with an information 
channel does tend to lower his perception of 
the cost of using that channel. 


DISCUSSION 


Any assumption that engineers act in 
accord with a simple instrumental learning 
model in which they turn most frequently to 
those information channels which reward 
them most often should now clearly be laid 
to rest. Engineers, in selecting among infor- 
mation channels, act in a manner which is 
intended not to maximize gain, but rather to 
minimize loss. The loss to be minimized is 
the cost in terms of effort, either physical 
or psychological, which must be expended 
in order to gain access to an information 
channel. 

Their behavior thus appears to follow a 
“law of least effort” (see, eg, Zipf, 1949). 
According to this law, individuals, when 
choosing among several paths to a goal, will 
base their decision upon the single criterion 
of least average rate of probable work. In 
other words, to minimize his average rate of 
work expenditure over time, ... an indi- 
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vidual estimates the probable eventualities, 
and then select(s) a path of least 

a of work through these [Zipf, 1949, p. 
61.” 

In the selection of information channels, 
the engineers certainly appear governed by a 
principle closely related to this. They attempt 
to minimize effort in terms of the work re- 
quired to gain access to an information chan- 
nel. However, they either ignore probable 
future work (searching through other chan- 
nels, upon failure to find the needed informa- 
tion through the first channel), or they are 
unable to estimate it, If, in fact, engineers 
were to consider future effort in making their 
decision, one would expect to find a negative 
correlation between the mean number of 
channels which must be used to gain the 
desired information after selecting one chan- 
nel as the first source. In other words, for 
each time that a given channel is chosen as 
a first source, the number (including zero) of 
channels which had to be used before the 
information was in hand can be counted. The 
mean number of additional channels used can 
be computed for each channel. This mean 
score serves as an index of the long-term 
effort (over and above that effort connected 
with gaining access) associated with each 
information channel. That correlating this 
index with the frequency of use of each chan- 
nel produces a near-zero correlation indicates 
that engineers do not take this long-term 
effort into account in selecting channels. So 
we are left with engineers behaving according 
to a simplified version of the law of least 
effort, in which they take only their immedi- 
ately predictable effort into account and 
minimize that parameter in making their 
decision. 

The implications of this finding are very 
important. Improving the quality or perform- 
ance of a particular information service will 
not, in and of itself, lead to increased use 
of the service. More investment in library 
holdings, for example, will be wasted unless 
at the same time this material is made more 
accessible to the user. Engineers will simply 
not be attracted to the library by improve- 
ments in the quality or quantity of the 
material contained there. The library must, 
in a sense, come to them. 
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Those concerned with R&D management 
whether they are laboratory directors, librar- 
ians, or administrators of professional engi- 
neering societies have both dimensions of this 
problem, to varying degrees, within their con- 
trol. To some extent they can control the 
quality of the information which is available 
through certain channels. The laboratory di- 
rector, for example, can through hiring make 
available a more competent technical staff to 
improve the quality of internal consulting. 
The librarian can improve the extent and 
quality of the material contained in the 
library. The professional societies can through 
careful editing and review improve the qual- 
ity of their present publications or make new 
publications or services available to the mem- 
bership. Improvements in any or all of these 

directions may well be desirable. But it must 
be remembered that such improvement, by 
itself, will not guarantee any improvement in 
the performance of the organization or of the 
engineering community. Before the improved 
information service can lead to increased per- 
formance, it must be used! And the only way 
to increased use is through increased acces- 
sibility. Fortunately, however, this parameter 
can usually be quite readily controlled by 
management. The laboratory director must 
not just hire more competent technical staff, 
he must encourage their use as consultants 
by rewarding them for this function, and by 
encouraging his engineers to go to them for 
information. He must make it known that 
no one will downgrade a man for seeking 
help and take steps to reduce the psychologi- 
cal risk and cost in internal consulting. The 
librarian must make his services more readily 
available. This may involve some rather ex- 
pensive additions to personnel (library assist- 
ants specializing in particular technical areas, 
special messenger services, mobile libraries, 
etc.). But unless this additional investment is 
made, the money spent on library holdings 
and facilities will be wasted. The engineering 
societies, too, can make their services more 
accessible. Many such improvements, as for 
example selective dissemination, demand de- 
posit, and the publication of critical annual 
reviews with extensive reference lists, have 
been discussed in recent years and many of 
these hold great promise. But there is a more 
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important step which the societies might take 
to make their material more accessible. The 
principal reason for the reluctance of engi- 
neers to use their professional literature js 
that, for the most part, they cannot under- 
stand it. Most of the professional engineering 
literature is too mathematically sophisticated 
for the average engineer to comprehend. It is 
therefore inaccessible to him. Now if one 
wishes to bring engineers into greater contact 
with their literature, there are two obvious 
approaches. The first is to operate on the 
engineer and enhance his mathematical so- 
phistication; equip him so that he can under- 
stand the literature as it now exists. The 
second and perhaps more tractable approach 
is to bring the literature to the engineer. The 


Professional societies should publish a litera- 


ture form, whose technical content is reason- 
ably high, but which is understandable by the 
audience to whom it is directed. This is not 
an easy task. And it does not necessarily im- 
ply that the societies forego all of their pres- 
ent program of publications. What it does 
mean is that these publications are at least 
supplemented by a series of more readable 
works. That this can be done is amply demon- 
strated by the success of a large number of 
profit-making, controlled circulation publica- 
tions, which engineers do read (Allen, 1966a). 
The engineering societies should give up their 
style of emulating the scientific societies. 
Their goals are different. There is nothing 
wrong with publishing nonoriginal contribu- 
tions in engineering. The goal of engineering 
publication is to inform, not to stake out 
claims. Only when the engineering societies 
come to realize this will they serve their 
audience properly. 

Furthermore, since engineers perceive chan- 
nels to be more accessible as they gain greater 
experience in using them, any steps to in- 
crease use through improving accessibility 
will be self-reinforcing. As laboratory man- 
agers make their internal consultants more 
accessible, they will come to be used more 
and the engineers’ perception of their acces- 
sibility will be further enhanced. As engineers 
gain more experience with the more accessib e 
professional literature, their perception of in- 
creased accessibility may generalize to the 
more sophisticated literature and encourage 
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them to work toward improving their ability 
to comprehend this literature. 

Engineers, it is true, do compensate, by 
varying the degree of skepticism with which 
they filter the ideas received, for their lack 
of attention to quality when selecting an 
information channel. This is an interesting 
phenomenon in which technical quality is 
recognized, but its consideration is delayed 
until after the channel has been selected. 
Accepting or rejecting messages is a process 
which takes place over a considerable period 
of time. The focus during this time is tech- 
nical quality (including, of course, cost and 
hardware availability). The alternatives under 
consideration are quite rigorously compared 
in terms of the quality dimension. It is no 
wonder then that the perceived reliability of 
the source is taken into account, and that 
ideas are accepted or rejected, at least in 
part, on that basis. 

From a simple engineering viewpoint, this 
seems to be an appallingly inefficient way of 
doing things: using the wrong criterion early, 
and later correcting for it. This is one more 
reason why steps must be taken to aid the 
engineer in overcoming the channel selection 
problem. Many of the suggestions made ear- 
lier in this section should more than pay for 
themselves in terms of engineering effort 
conserved. ` 


CONCLUSIONS 


Empirical support is found for the model 
of the information-channel selection process 
first proposed by Allen (1966a). In this 
process, channel selection is based almost 
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solely on channel accessibility, the most 
accessible channels being used frst. 

The following specific conclusions are de- 
rived from the study: 

1, Accessibility is the single most impor- 
tant determinant of the overall extent to 
which an information channel is used. 

2. Both accessibility and perceived tech- 
nical quality influence the choice of first 
source, 

3. Perception of accessibility is influenced 
by experience. The more experience an engi- 
neer has with a channel, the more accessible 
he perceives it to be. 
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JOB PREFERENCES AND PREFERENCE SHIFTS AS FUNCTIONS 
OF JOB INFORMATION, FAMILIARITY, 
AND PRESTIGE LEVEL 


RALPH D. NORMAN anp DAVID W. BESSEMER 


University of New Mexico 


Preference ratings of 10 familiar professional, 10 familiar skilled, 10 unfamiliar 
professional, and 10 unfamiliar skilled labor jobs, presented as either titles or 
descriptions, were obtained from 100 male high school students. Professional or 
familiar jobs were preferred as titles but these effects were absent or reduced 
in ratings of descriptions of the same jobs. Changes in job preferences in a 
2nd rating were greater when Ss were shifted from titles to descriptions, or 
the reverse, than when the identical type of material was rated, indicating non- 
equivalence between stimulus materials in study of job attitudes. Implications 
for personality and interest tests using job titles as items are discussed. 


In interest and personality tests such as the 
Vocational Preference Inventory (Holland, 
1965), Minnesota Vocational Interest Inven- 
tory (Clark, 1965), and the Strong Vocational 
Interest Blank (Strong, Campbell, Berdie, & 
Clark, 1966), the S states his occupational 
preferences. Frequently, however, only job 
titles are presented to § without further in- 
formation. Quite some time ago, several stud- 
ies (e.g., Bateman & Remmers, 1939; Nick, 
1942; Recktenwald, 1946, 1948) demon- 
strated the effect of information presented as 
lectures, discussion, reading, etc., on job-atti- 
tude change. Hoppock (1967) argues that 
providing individuals with such information 
assumes that attitudes are affected by what 
they know about vocations. He contends that 
simple job titles afford only limited cues, and 
suggests guidelines for providing occupational 
information, including title, description, quali- 
fications, demands, etc. However, these guide- 
lines are obviously uneconomical when more 
than a few jobs are to be presented for atti- 
tudinal evaluation, and thus have not been 
used to construct materials for interest tests 
or job-preference studies. 

Information and one of its resultants, fa- 
miliarity, are not the only variables affecting 
attitudes toward job titles. Using a few limited 
titles, Osgood and Stagner (1941) found that 
occupational stereotypes conform closely to a 
framework based on prestige, the latter being 
imputed to titles per se on the basis of brains, 
being noticed, financial return, excitingness, 
etc. Moreover, although not immediately rele- 


vant to the present study, recent factor-an- 
alytic research using titles alone disclosed 
differences among them on five major factors, 
which in one study (Gonyea, 1961) failed to 
resemble conventional interest groupings or 
job classifications, but which in a second in- 
vestigation (Gonyea & Lunneborg, 1963) with 
different methodology resembled a classifica- 
tion proposed by Holland (1959). In a study 
designed to examine the role played by job 
descriptions in interaction with titles, Osipow 
(1962), using the semantic differential, found 
differences in attitude toward differing job 
titles (Janitor or Building Superintendent) for 
the same job on evaluative and potency di- 
mensions which disappeared when descriptions 
were given. Osipow suggests that perceptions 
of jobs may be changed with minimal descrip- 
tions. 

Attitudes toward job titles (vocational 
stereotypes) have been employed by Holland 
as the basis of a personality inventory (Hol- 
land, 1958) and as a means of developing 4 
theory of personality types based in part on 
vocational choice. Holland (1966) in fact 
states: 


The assumption that vocational stereotypes have re⸗ 
liable psychological and sociological meanings is im- 
portant because most interest inventories rest heavily 
on its validity, If vocational stereotypes were unre- 
liable and inaccurate, the validity of interest inven- 
tories would be seriously reduced. In the case of the 
present theory, the validity of this assumption lends 
credence to the first assumption—vocational choice 
is an expression of personality—and to several sub- 
sequent assumptions [p. 5]. 
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The recent work with job titles invites fur- 
ther investigation of variables affecting atti- 
tudes toward them, such as familiarity, pres- 
tige level, and information provided by job 
descriptions. Regarding the latter, it was felt 
that somewhere between the completely re- 
stricted cues of titles alone and the detailed 
guidelines suggested by Hoppock (1967) 
should lie brief but informative kinds of occu- 
pational materials: job descriptions condensed 
from the Dictionary of Occupational Titles 
(DOT) (United States Employment Service, 
1949).* Such descriptions should serve to con- 
vey the general “flavor” and essential details 
of job duties, without being elaborate and 
time consuming. They should also be particu- 
larly useful when job titles are unusual or 
misleading, and should tend to reduce the in- 
fluence of prestige level and occupational 
stereotypes on measures of job attitudes and 
preferences, as Osipow (1962) clearly demon- 
strated. 

The present study was designed principally 
to compare titles and descriptions in terms of 
the influences of job familiarity and prestige 
level on occupational preference ratings. In 
addition, results were obtained concerning the 
stability and equivalence of ratings of these 
two kinds of materials. 


METHOD 
Materials 


All jobs in the DOT 0 code (professional) and 
DOT 4 and 5 codes (skilled labor) were scanned, in 
order to select, on an a priori basis, familiar or un- 
familiar jobs in each category. An attempt was made 
to match at each code level a familiar with one or 
more quite unfamiliar jobs in the same “ob family.” 
For example, Electrician, a skilled familiar job, was 
paired with Boardman, an unfamiliar job in the 
same three-digit group. Sixty-eight titles were thus 
selected, representing a wide range of occupations, 
and preliminary ratings of familiarity were obtained 
from small groups of college freshmen (us of 16 
and 17). Inspection of mean ratings caused attrition 
of 20 titles because no matched job was sharply 
discriminated on familiarity. The preliminary ratings 
were also used to determine which matched job 
offered the greatest familiarity difference for each 
original title (eg, Controller was much more un- 
familiar than Comptroller as a match for Account- 


1The 1949 edition of the DOT rather than the 
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sure completely unfamiliar titles, 
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ant). The remaining 48 jobs, half professional and 
half skilled, were then submitted by title to another 
group of freshmen (n = 116) who again rated these 
jobs on a 7-point scale as to their degree of famili- 
arity with them. By means of the sign test (Siegel, 
1956), 40 titles were then selected which were sharply 
discriminated as to familiarity. These 40 jobs, evenly 
divided into 10 professional familiar (PF), 10 pro- 
fessional unfamiliar (PU), 10 skilled familiar (SF), 
and 10 skilled unfamiliar (SU), their DOT codes, 
and results of the sign test, are presented in Table 1. 
In all except 1 of the 40 jobs, the match of familiar 
with unfamiliar occurred exactly for the first three 
digits of the DOT code, revealing the “job family” 
closeness of the 20 pairs. 

In order to have a well-controlled stimulus for 
job descriptions, each of the longer DOT descriptions 
of these 40 jobs was condensed to a length of 20-30 
words. Mean length was 22.5 words for professional 
jobs and 23.1 for skilled. In rewriting descriptions, the 
title of the job was, of course, scrupulously avoided. 
Thus, the two types of presentation in the main ex- 
periment had one which gave the title and no other 
cues, and one which gave a brief description of the 
job duties but no title. 


Subjects and Design 


Four groups of volunteer Ss rated the 40 occupa- 
tions on two separate occasions spaced 1 mo. apart. 


The groups differed only in the type of material, 
that is, titles or descriptions, rated in each session: 
Group I, Title-Title (TT); Group II, Description- 
Title (DT); Group III, Title-Description (TD); 
Group IV, Description-Description (DD). The basic 
plan was thus a 2 X 2 crossover with respect to type 
of material and order of presentation, with additional 
repeated measured factors of familiarity (familiar 
versus unfamiliar jobs), and prestige level (proſes- 
sional versus skilled jobs) as described above. 

A total of 240 Ss, all male high school seniors, 
had been randomly preassigned with 60 in each 
group. At the first session, 224 Ss appeared, with 
slightly uneven amounts in each group. For adminis- 
trative reasons, Ss were tested as a single group in 
both rating sessions conducted in the school cafeteria 
during specially assigned periods. They were aware 
only of being assigned to one of four groups (not 
the rationale of the groups) who were given an ap- 
propriate booklet in which titles or descriptions, as 
the case might be, were arranged in random order. 
The Ss were instructed to rate the stimuli on a 5- 
point Likert-type scale from 1 (like very much) to 
5 (dislike very much). In order to stimulate interest 
prior to rating, they were asked to indicate, in addi- 
tion to their names and ages, their father’s job, own 


TABLE 2 


MEAN PREFERENCE RATINGS OF JOB TITLES 
AND DESCRIPTIONS 


First rating | Second rating 
Type of Job M SD M | SD 
material | class i 
Groups TT Groups TT 
&TD &DT 
Titles PF | 3.00 72 3.03 | .76 
PU | 3.45 66 3.50 | .72 
SF | 3.43 75 3.43 74 
SU | 3.70 69 3.76 | .69 
Groups DT Groups TD 
& DD & DD 
Descriptions | PF | 3.16 59 3.09 -70 
PU 3.50 53 3.40 68 
SF 3.24 62 3.37 61 
SU 3.42 58 3.51 57 
Total PF | 3.08 66 3.06 73 
PU | 3.47 60 3.48 70 
SF | 3.34 69 3.40 68 
SU | 3.56 64 3.64 63 


Note.—Abbreviations: PF = professional familiar; PU 
= professional unfamiliar; SF = skilled familiar; SU = skilled 
unfamiliar. TT = Title-Title; TD = Title—Description; DT 
= Description-Title; DD = Description—Description. 
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goal job, and college-attendance intent. They were 
also told that Es were interested in their attitudes 
toward certain jobs, that their cooperation would 
aid research in this area, and to do their best in rat- 
ing all stimuli presented to them. The Ss were not 
informed of the type of material rated by other Ss. 
Of the number who participated in the first session, 
100 failed to return for the second rating (50 from 
the “title” condition and 50 from the “description” 
condition) but no significant differences were ob- 
served in the first session data between second session 
returnees and nonreturnees. An additional 24 Ss were 
randomly eliminated to bring the n of each group 
to equality at 25 Ss. 


RESULTS 
Job Preferences 


The mean ratings for each job class and 
type of material are shown in Table 2. There 
were no statistically reliable effects attributa- 
ble to order of presentation, and therefore the 
data of the groups rating titles and of those 
rating descriptions, respectively, were com- 
bined for the first and second ratings, as the 
case might be. 

The ratings of titles consistently showed 
professional to be preferred to skilled labor 
jobs, and familiar to be preferred to unfa- 
miliar jobs. These differences, however, ap- 
peared to be considerably reduced in the 
ratings of descriptions. When titles alone were 
rated, the difference between means of pro- 
fessional and skilled jobs was .34; the same 
difference for descriptions was .08. Again, 
when titles alone were rated, the difference 
between means of familiar and unfamiliar jobs 
was .38; the same difference for descriptions 
was .26. Analysis of variance of the data of 
the first rating showed significant effects of 
familiarity (F = 67.5, df = 1/96, p < 001 
and prestige level (F = 7.1, df =1/96, P< 
.01). Analysis of variance of the second rat- 
ings resulted in similar significant effects: fa- 
miliarity, F = 111.6, df=1/96, p< 001; 
prestige level, F = 10.8, df = 1/96, p < 01; 
but additionally there was a significant inter- 
action between these variables (F = 10.6; 
df = 1/96, p < .01). It should be noted that 
no particular meaning can be attached to this 
interaction since the familiar and unfamiliar 
jobs were not selected to insure an equal dif- 
ference in familiarity within skilled and pro- 
fessional categories, respectively. While the 
familiarity effect is somewhat larger for Pro- 
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rating (F = 1.3, dj = 1/96, p> .20). A sig- — 
nificant interaction between type of 1 ah 
and familiarity was obtained in the latter an- EU 
alysis (F = 5.7, dj = 1/96, p < .0S), appar- 

ently reflecting the reduction in the difference were obviously preferred to unfamiliar 
between familiar and unfamiliar jobs found regardless of the occupational 
for ratings of descriptions as compared to in both the first and second 
those of titles. This reduction must be attrib- would be comprehensible in the ti 
uted largely to the ratings of skilled jobs pre- where the cue was absolutely minimal, 
sented as descriptions, although the three-way one word. However, it is interesting 
interaction was not statistically significant 
(F = 23, df = 1/96, p > .20). familiar,” was found in the description 


Prejerence Shifts elaboration. The only effect of cue form 
Table 3 shows the mean absolute change in familiarity was to reduce , but 
job preferences obtained between the first and eliminate, its effect in the second rating. There 
second ratings. Since no statistically signifi- are two possible reasons for the effect of per- 
cant differences were obtained between the sistence of the effect of familiarity despite 
two groups rating the same type of material the additional information provided by the 
on both occasions, or between the two groups description. On the one hand, there would 
rating different types of material, the data seem to be a tendency for job 
were combined for groups having constant and familiar jobs to specify relatively 

changed materials, respectively. tivities with the result that 
The preference shifts were clearly larger descriptions to those of the 
when the cue form was changed between the (This hypothesis is offered despite 
two ratings, as might be expected, but the that in the descriptions the words per 
effect was more pronounced for unfamiliar all within the vocabulary level of high 
than for familiar jobs. An analysis of variance seniors.) On the other hand, inspection 
confirmed the reliability of these effects, show- 


ing significance for change in materials (F = 5 
15.5, df = 1/96, p < 001) and familiarity level, involved anes yor general 
(F = 12.5, df = 1/96, p < 001), and inter- whereas those o unfamiliar ones 2 
action between these variables (F = 12.0, to specify drag oe specialized 
df= 1/96, p<.001). No other significant ties, In essence 7 — — 
effects were obtained in the analysis of the less opportunity for i = an ng 
shift scores element to Ss such as those of this study. 
1 Regarding prestige level, results showed 
DISCUSSION that job descriptions were af i s in dimi, 
Looking at the absolute preference ratings nating biases in favor of professi 
first, Table 2 data show that familiar jobs found when titles alone were rated. The means 
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for the first rating were: titles, Professional 
(P) = 3.23, Skilled (S) = 3.57; descriptions, 
P = 3.33, S = 3.33. For the second rating the 
means were: titles, P = 3.27, S = 3.60; de- 
scriptions, P = 3.28, S = 3.44. While the pres- 
tige level effect was not entirely eliminated 
by the presentation of descriptions in the sec- 
ond rating as it was in the first, it is the data 
of the first rating that would appear to have 
the most relevance to any application of these 
findings to interest inventories, since such tests 
are usually taken only once. To summarize 
here briefly the two main findings, it might 
be said that job description was more effec- 
tive in eliminating biases toward prestige level 
than it was in eradicating the effects of fa- 
miliarity. 

Turning attention now to shift scores, it is 
obvious that the degree of intersession varia- 
tion in ratings with the type of material un- 
changed was comparable across occupational 
levels and degree of familiarity. Additionally, 
the finding of equal shifts for the TT and DD 
groups in the preliminary analyses indicates 
that ratings of titles and descriptions were 
equally stable over the interval used here. 
There was clearly less consistency in ratings 
with changed cue form, indicating that Ss 
generally did not rate titles and descriptions 
in a similar fashion. This effect was much 
more pronounced for unfamiliar than for fa- 
miliar jobs. In fact, it did not seem to appear 
for the PF jobs (perhaps not quite as ob- 
viously so for SF jobs), indicating that S rec- 
ognized these even in description form. The 
implication of this particular finding is that 
additional information provided by job de- 
scriptions does not compensate for the effects 
of familiarity very well in terms of mean rat- 
ings. It is clear that information influences 
ratings much more for unfamiliar than for fa- 
miliar jobs, a situation which would indicate 
that descriptions are more valid indicators of 
preferences than titles for unfamiliar jobs. 
On the other hand, either titles or descrip- 
tions are relatively interchangeable as meas- 
ures for jobs familiar to S. 

The present study, while admittedly op- 
erating with a model of extremes (familiarity 
or unfamiliarity, title or description, higher 
or lower prestige level), has some implications 
for interest inventory item construction, as 
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well as for Holland’s (1966) work, which 
makes extensive use of occupational titles, 
Regarding interest inventory items, the argu- 
ment by Kuder (1966) concerning his new 
Occupational Interest Survey that “occupa- 
tional titles should be avoided in order to keep 
obvious vocational significance to a minimum” 
might carry more weight in view of the find- 
ings of this study. As is well known, many of 
the Kuder items, which involve activities, are 
in essence exceedingly brief occupational de- 
scriptions. Different item formats may thus 
partially account for reduced correlations be- 
tween inventories (e.g., Strong versus Kuder) 
exploring similar vocational interest domains. 
Fairly recently, King, Norrell, and Powers 
(1963) studied the relationships between 
‘identical twin” scales (those on the Strong 
and Kuder bearing the same name) and “fra- 
ternal twin” (similar but not identical) scales 
and found generally that even r’s for “iden- 
tical twin” scales were rather low, the average 
coefficient being .370. 

Concerning Holland’s research, since titles 
alone are apparently sensitive to other varia- 
bles, such as prestige and familiarity, varia- 
tions in S’s choices (and consequent personal- 
ity typing) may be influenced by these. For 
example, in the latest revision of his Voca- 
tional Preference Inventory (Holland, 1965) 
some relatively “unfamiliar” job titles are 
used, such as Meteorologist, Quality Control 
Expert, Manufacturer’s Representative, Traf- 
fic Manager, Inventory Controller, etc. His 
titles also vary widely over prestige levels. 
Probably the most prestigious of his six voca- 
tional-personality types is the Intellectual, 
and the least prestigious is the Realistic (Hol- 
land, 1966). Many of the job-title preferences 
of the former closely resemble the PF cate- 
gory in this study; those of the latter resemble 
the SF category. Whereas the basic validity 
of his theory of vocational choice is not ques- 
tioned, it would be valuable to examine the 
160 Vocational Preference Inventory titles for 
significant preference shifts, if any, with job 
descriptions. 
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NEED TYPE AND JOB SATISFACTION AMONG 
INDUSTRIAL SCIENTISTS * 


GEORGE B. GRAEN,? RENE V. DAWIS, ann DAVID J. WEISS 


University of Minnesota 


Need-fulfillment theories of job satisfaction generally assume that individuals 
differ in the outcomes they prefer (need) to obtain from their jobs. Moreover, 
these theories hypothesize that the relationship between the outcomes received 
on the job and satisfaction is dependent upon these preferences (needs). The 
hypothesis this study investigated is that the pattern of preferences for job 
outcomes moderates the relationship between preference for an outcome and 
satisfaction with that outcome. Questionnaires were employed to assess prefer- 
ence and satisfaction on 30 different job outcomes. Ss were 113 industrial re- 
search scientists working in 1 facility, Preference (need) ratings were subjected 
to Q cluster analysis. 2 “need” types were identified. It was found that need 
type moderated the relationship between preference (need) for an outcome 
and satisfaction with that outcome only on those outcomes most closely related 
to the company itself. A hypothesis was proposed that these 2 need types 
Teflect the 2 types of research and development personnel that differ primarily 
in orientation toward the company itself. 


The significance of individual differences in 
motivational variables for the prediction of 
job satisfaction has been emphasized by sev- 
eral writers (Morse, 1953; Schaffer, 1953; 
Vroom, 1960, 1964). The rationale commonly 
associated with such predictions is the need- 
fulfillment model, which states that job satis- 
faction is a function of the degree to which 
needs are met (“fulfilled”) by the work en- 
vironment. According to this model, two in- 
dividuals with different sets of needs, even 
though working in the same jobs, with similar 
working conditions and similar compensation, 
would not necessarily exhibit the same level 
of job satisfaction. 

In a more recent statement of the need-ful- 
fillment model, the Theory of Work Adjust- 
ment (Dawis, England, & Lofquist, 1964) 
hypothesizes that job satisfaction is a function 
of the correspondence between the needs of 
the individual and the reinforcer (i.e., need- 
fulfillment) system of the job. In common 
with earlier need-fulfillment models, this the- 
ory places major emphasis on individual dif- 
ferences in needs, but leaves to empirical re- 

This study was supported in part by Research 
Grant RD-1613-G from the Vocational Rehabilita- 
tion Administration, United States Department of 
Health, Education and Welfare, Washington, D. C. 

2Now at the University of Illinois, Urbana. The 
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search the task of specifying the nature of 
the relationship between needs and reinforcer 
systems which is predictive of job satisfaction. 

In the present study, a moderator variable 
is hypothesized as affecting the relationship 
between need-reinforcer correspondence and 
job satisfaction. The “need-type hypothesis” 
proposes that the pattern of needs of the in- 
dividual is an important moderator variable 
in the prediction of job satisfaction. More 
specifically, the hypothesis states that different 
degrees of accuracy in predicting satisfaction 
will be found for groups with different need 
patterns, that is, for different “need types.” 
The need-type hypothesis therefore assumes 
that there exist different need types, that is, 
individuals with distinctive need patterns. 
These need types are seen to differ mainly in 
the way their needs are interrelated. Thus, 
need type may be conceptualized as needs 
organized in complex interaction. 


METHOD 


To test the need-type hypothesis, Ss must be 
working on jobs which have equivalent reinforcer 
systems. Industrial research scientiste working in re- 
search and development for one company and en 
one location may be assumed to meet this require- 
ment. A measure of work-relevant needs also is re- 
quired for a test of the hypothesis. The Minnesota 
Importance Questionnaire (Weiss, Dawis, England, 
& Lofquist, 1964b), an instrument developed to 
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the study employed the subtractive model, which 
assumes that the more completely the need is “ful- 
filled,” the greater the satisfaction; that is 


fulfillment (reinforcement) of 
satisfaction. Evidence for the moderating effect of 
need type would be found in differences in the need- 
satisfaction relationship for different need-type groupe 
or, in terms of analysis of variance, an Interaction 
effect for need-type group and need level on the de- 
pendent variable of satisfaction, 

The Ss were 113 industrial scientists working in re- 
search and development for a large company. All 
worked in the same organizational unit. Median age 
was 34 yr. and median tenure with the company 
was 4.3 yr. 

The instruments employed were the 1966 revisions 
of the Minnesota Importance Questionnaire (MIQ) 
and the Minnesota Satisfaction (MSQ). 
The MIQ consisted of 117 Likert-type items and was 
scored on 30 scales. Respondents were asked to rate 
the importance of various aspects of work to their 
idea of an ideal job. For each item, the response 
choices were “not important,” “only slightly im- 
portant,” “important,” “very important,” and “ex- 
tremely important.” The response choices were 
scored 1 through $, respectively. Validity evidence 
for the MIQ as a measure of needs is reported in 
the Minnesota Studies in Vocational Rehabilitation 
series (Weiss, Dawis, England, & Lofquist, 1964a, 
1964b, 1965). 

The MSQ consisted of 150 Likert-type items and 
was scored on 30 scales representing the same rein- 
forcement dimensions utilized for the MIO.“ Respon- 
dents rated the degree of satisfaction they felt with 
various aspects of their job. For each item, the re- 
sponse choices were “not satisfied,” “only slightly 
satisfied,” “satisfied,” “very satisfied,” and “extremely 
satisfied.” These were scored 1 through 5, respectively. 
The MIQ and the MSQ were to the Ss 
in the work setting during working hours. To en- 
courage frankness, the identity of the respondents 
was kept anonymous. 

The MIQ scale scores were transformed into T 
scores with a mean of 50 anda standard deviation of 
10 to equate scale means and variances. Q correla- 
tions between each individual and all other indi- 
viduals were computed for the 113 individuals over 
the 30 T-scored MIQ scales. To identify homogencous 
(need type) groups the 113 X 113 Q-correlation 
matrix was cluster analyzed using Tryon's procedure 
(Fruchter, 1954). 


3A table giving reliability coefficients of the MIQ 
and MSQ scales has been deposited with the Ameri- 
can Documentation Institute. Order Document No. 
9950 from ADI Ausiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washington, 
D. C. 20540. Remit in advance $1.25 for photocopies 
or $1.25 for microfilm and make checks payable to: 
Chief, Photoduplication Service, Library of Congress. 
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ous (need · type) group. It was called the “in” 
cluster, All 81 Ss not included in the in cluster 
represented a heterogeneous need group which 
was called the “out” cluster. 

Of the 30 two-way analyses of variance, 8 
yielded significant interactions between need 
group and need (MIQ), with satisfaction 
(MSQ) as the dependent variable. Table 1 
shows the results of these eight ANOVAs. 
Only on the Security scale was there a sig- 
nificant need - group mean difference in satis- 
faction score. 

The three largest interaction F ratios were 
found on the dimensions of Working Condi- 
tions, Security, and Company Goals. Other 
scales with significant interactions were Au- 
thority, Company Image, Company Policies 
and Practices, Supervision-Human Relations, 
and Social Status. 

The interaction effect is apparent from a 
comparison of the means of the four cells. In 
the case of the in cluster, the high-need group 
showed higher satisfaction than the low- 
need group on all eight dimensions. In con- 
trast, for the out cluster, with one exception, 
the high-need group showed lower satisfaction 
than the low-need group on these dimensions. 


TABLE 1 


SATISFACTION MEANS FOR THE EIGHT DIMENSIONS 
WITH SIGNIFICANT INTERACTIONS 


In“ cluster | “Out” cluster 
Scale 
Low | High | Low | High 
need | need | need | need 
Authority 14.36 | 15.78 | 15.35 | 14.30* 
Company goals 12.13 | 14.19 | 13.29 | 11.16** 
Company image 13.57 | 16.78 | 14.86 | 15.68* 
Company policies and | 13.90 | 15.83 | 14.64 | 13.57* 
practices 
Security 14.39 | 17.16 | 15.02 | 14.11** 
Social status 14.36 | 15.14 | 15.57 | 14.15* 
Supervision-human 15.47 | 17.60 | 16.56 | 15.05* 
relations 
Working conditions 15.92 | 19.26 | 18.75 | 17.18** 


Note.—N = 113, SIN 
* Interaction between need group and need level significant 
at p < .05, 


Interaction significant at p < .01. 


Correlations between MIQ scores (on a 
continuous scale) and MSQ scores on the 
corresponding dimension, for the eight dimen- 
sions with significant interactions, are shown 
in Table 2. Correlations for the total sample 
ranged from —.20 to .34. These two extreme 
values were the only correlation coefficients 
differing significantly from zero. The correla- 
tions for the in cluster ranged from .11 to .60 
with four of the eight coefficients differing 
significantly from zero. In contrast, the cor- 
relations for the out cluster ranged from —.34 
to .23, with four of these being significant. 
The significant correlations were positive for 
the in cluster and, with one exception, nega- 
tive for the out cluster. Fisher’s Z transforma- 
tion was employed to test the differences in 
correlations between the in cluster and the 
out cluster. Significant differences were found 
on seven of the eight dimensions, Only the 
correlation difference for the Supervision-Hu- 
man Relations dimension was found to be non- 
significant. The largest difference between cor- 
relations was on the Security dimension, which 
was .60 for the in cluster and —.16 for the 
out cluster. 

An analysis of demographic variables for 
the two clusters revealed no differences be- 
tween the groups in age, education, tenure on 
the job, tenure with the company, time in 
the profession, job classification, or work- 
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group membership. Similarly, no differences 
between the means or variances of the in 
cluster and the out cluster were found on any 
of the MIQ scales. 


Discussion 


The hypothesis that need type moderates 
the prediction of job satisfaction finds sup- 
port in this study. For the present group of 
industrial scientists working in research and 
development for one company, there appears 
to be at least two need groups for which the 
relationship between need level and satisfac- 
tion on several dimensions is quite different. 
The improvement in the accuracy of pre- 
dicting satisfaction contributed by the need- 
type variable can be seen in Table 2. The 
correlations between corresponding need and 
satisfaction dimensions for the entire sample 
masked the correlations within each of the 
need groups. For instance, the correlation for 
the entire sample on the dimension of Security 
was .03 which masked correlations of .60 for 
the in cluster and —.16 for the out cluster. 
Similarly on Working Conditions, the overall 
correlation was —.01, but for the in cluster 
and out cluster, respectively, the correlations 
were .48 and —.20. Thus the moderating ef- 
fect of the need-type variable in this study is 
evident. 

It is of some interest to inquire into the 
nature of the two need groups. The in 
cluster and the out cluster identified in this 


TABLE 2 


CORRELATION BETWEEN NEED LEVEL AND SATISFAC- 
TION ON THE EIGHT DIMENSIONS WITH 
SIGNIFICANT INTERACTIONS 


Need group 
Total 
Scale group A 2 
(N = 113) "Taa “Out 
cluster cluster 
(N = 32) | (N = 81) 
5* 
Authority —.0¹ 25 —.23* 2.15% 
Company goals —.20* 11 54 a 
Company image 34h 50 %% 25 3.055% 
Ooy policies —.06 43% —.20 3. 
and practices be 
Security +03 00 be. = 16, 7415 
Social status =.16 20 —.27 1.59 
e 00 +25 =.09 . 
uman relations ak 
Working —.01 48 46 —.20 3.34 
conditions 


* Z values on Fisher's test for the difference between in 
cluster and out“ cluster correlation coefficients. 
* p< .05, 
wt» S 01. 
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study differed in need - satisfaction relation- 
ships on the dimensions of Authority, Com- 
pany Goals, Company Image, Company 
Policies and Practices, Supervision~Human 
Relations, Security, Social Status, and Work- 
ing Conditions. These are, for the most part, 
the dimensions most closely associated with 
satisfaction with the company itself. Among 
the 30 dimensions included in this study, the 
only “company” related dimension on which 
a significant interaction effect was not found 
was that of Company Prestige. Yet, even on 
this dimension the correlations between need 
and satisfaction were in a similar direction, 
being .15 for the in cluster and —.19 for the 
out cluster. 

These findings suggest that the two need 
groups might differ in their orientation toward 
the company. The findings might be inter- 
preted from the viewpoint of the “profes- 
sionally oriented” and “company oriented” 
dichotomy for research and development per- 
sonnel proposed by Schultz (1964). According 
to Schultz, the professionally oriented view 
the company merely as a means to achieve 
their broader professional goals and rewards, 
whereas the company oriented view the com- 
pany as the primary source of their achieve- 
ments and rewards. The in cluster in this 
study might correspond to the company- 
oriented type, viewing the company as the 
major source of need fulfillment. Thus, on 
the “company” dimensions their satisfaction 
increases and decreases with need level. In 
contrast, the out cluster might correspond to 
the professionally oriented type, or, at least 
it is not a company-oriented type. Its mem- 
bers apparently do not perceive the company 
as providing fulfillment of their needs. Thus, 
the higher their need level, the lower their 
satisfaction, These data seemingly support the 
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BRAND AWARENESS: 


EFFECTS OF FITTINGNESS, MEANINGFULNESS, AND 
PRODUCT UTILITY? 


RABINDRA 


N. KANUNGO 


Dalhousie University, Halifax, Nova Scotia 


Comparative influences of fittingness and meaningfulness of the brand name and 


the utility of the product represented 


by the brand name on brand awareness 


were studied. An operational definition of fittingness and a method of selecting 
fitting brand names were proposed. 24 male and 24 female Ss Ist rated brand 


names from advertisements and then 


recalled the brand names. These adver- 


tisements included 3 levels of product utility, 2 levels of meaningfulness, and 
2 levels of fittingness following a factorial design, Brand awareness was better 
for (a) highly meaningful and (b) fitting brand names. Meaningfulness influ- 
enced brand awareness when brand names were nonfitting, but not when they 
were fitting. Effects of cultural factors on brand awareness were noticed when 
brand recall for 3 product-utility categories was analyzed. 


Recently, Kanungo and Dutta (1966) re- 
ported that brand awareness is better if the 
brand name is highly meaningful than if it is 
of low meaningfulness, and that brand aware- 
ness of high-utility products is superior to that 
of low-utility products, In addition, they sug- 
gested that the technique of measuring mean- 
ingfulness (Kanungo & Panda, 1964a, 1964b; 
Noble, 1961) of verbal labels could be fruit- 
fully used by the advertisers for the purpose 
of brand naming. As an extension of this 
earlier study conducted in India (Kanungo 
& Dutta, 1966), the present investigation at- 
tempts to test the generality of the above 
findings using subjects (Ss) belonging to a dif- 
ferent cultural milieu. 

The present study also examines the effects 
of the fittingness or the appropriateness of the 
brand name for the product on brand aware- 
hess. Fittingness of a brand name seems to be 
an important characteristic that might influ- 
ence the strength of brand-product associa- 
tion. To the present author’s knowledge, no 
study has been reported on the effects of fit- 
tingness variable on brand awareness. The 
main obstacle to the systematic investigation 
of the effects of this variable has been the 


The study was supported partly by Grant No. 
X-12-179 from National Research Council of Can- 
ada and partly by Grant No. X-85-8 from Dalhousie 
University Research Development Fund. The author 
is indebted to Marcia Earhard for many helpful 
comments. 


lack of any operational definition and an ap- 
propriate measure of fittingness of brand 
names. Many advertisers often try to select 
or reject brand names on the basis of their 
fittingness to their product, but such selections 
are mostly based upon their intuition or sub- 
jective feelings due to the absence of any 
operational measure. The present study, there- 
fore, tries to offer an operational definition 
and a measure of the fittingness characteristic 
of the brand name, so that systematic study 
of the effect of the variable on brand aware- 
ness may be possible. 

It is proposed that a brand name may be 
considered fitting or appropriate for the prod- 
uct if it can readily evoke associations to the 
product in the minds of the Ss. Thus, accord- 
ing to the present author, a brand name is 
fitting for the product if it resembles (in form, 
sound, or meaning) a word that belongs to 
one of the most frequent associations to the 
product. For example, if the product lipstick 
evokes the word “color” as one of the most 
frequent associations, then the brand name 
“Kolory” (apparently a nonsense word, but 
having resemblance in form and sound to the 
word “color”) may be considered a fitting 
brand name for lipsticks. However, the same 
nonsense word will be a nonfitting brand name 
for the product smoking pipe if the word 
“color” is only a rare association to the prod- 
uct smoking pipe. This implies that associa 
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strength of the brand-product 
would be greater in the case of 
the case of nonfitting brand name. Thus with 
single or repeated exposures brand awareness 
would be better in the former than in the 
latter case. 


f 
; 


MeTHOD 
Selection oj Brand Names 


Consideration of the fitingaes, the 
ness, and the product-utility variable 
the present study influenced the selection of 
names, First, to manipulate the product 
able, there was a need to choose products 
fered in their utility for the 
different categories of products 
products used by males, by 


sexes. Four commonly used 


tl 
i 
| 
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tilt 


these categories were chosen, making a total of 12 
products, 
Second, in order to choose brand names that dif- 


fered in their fittingness for the 
ing procedure was employed. 


E 
i 
H 


names was printed on a separate sheet of paper and 
presented to a group of 19 male and 19 female col- 
lege undergraduate students, For each product, they 
were asked to give as many relevant associations as 


they could within a 30-sec, 
instructed not to give any 
their associative responses. Al 
by the group were found to be 
The most frequent (occurring 
rarest (occurring only once) 
uct were then determined from the responses 
by the group. From these response words, two 
and two nonfitting brand names for each 
were constructed. One fitting and one 
brand name were highly similar to a and 
2 rare response, respectively, and the other fitting 
and nonfitting brand names only partially resembled 
in sound or form a frequent and a rare response, Te- 
spectively. The reason for making half the brand 
names more similar to the associative responses than 
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More meaningful than the latter. 

Third, to make sure that half of the brand names 
had higher meaningfulness value than the other half, 
another 25 undergraduate students were presented 
with these 48 brand names. These Ss were asked to 
rate each brand name on a S-point meaningful- 
meaningless scale. value 
(n) for each of the brand names was calculated 
from these ratings. The products, their frequent and 
rare associates, and the brand names along with their 
m' are presented in Table 1. 
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Note.—Items 1-4 are female-use products, Items 5-8 are male-use products, and Items 9-12 are products used by, bots sexes + 
The bracketed word(s) next to each brand name is the associative response word(s) from which the brand name was derived. 
F = fitting; NF = nonfitting. 


purpose of asking Ss to rate the brand names was ble and to write them beside the name of the product 
twofold. First, through such ratings, Ss were given with which they were associated. 
incidental exposure to the advertisements, so that 
later test of brand awareness would be possible. RESULTS 
Second, Ss’ ratings would directly demonstrate if Ss R 
perceived the meaningfulness and appropriateness Ratings of Brand Names 
characteristic of the brand names in a manner in- . . iment 
tended by the experimental manipulations. The first part of S’s task in this expels 1595 
Each $ was given approximately 1-1.5 min. to Was to rate each brand name for its meani 
rate each brand name on the two scales. After S fulness and appropriateness. There were 4 
hag finished ating all nee 100 us the rooie brand names, half of which were HM 4 
e answer sheets ani e booklet were collected. ikewise 
The § was then given a recall sheet on which all the other half LM brand names. Li the rest 
the 12 products were listed in alphabetical order and Of the 48 brand names were F and 15 
was asked to recall as many brand names as possi- NF brand names (see Table 1). The adve 


TABLE 1 
Propucts, Source Worps, AND Branp NAMES WITH THEIR MEANINGFULNESS (m’) VALUE l 
High meaningful Low meaningful 
Product name 
Brand name m' Brand name m' 
1. Nylons 
F “Legs” (Leg) 4.36 “Leget” (Leg) 1.28 
NF “Feel” (Feel) 4.12 “Supet” (Superior) 1.20 : 
2. Girdle | 
F “Shapely” (Shape) 4.48 “Tyten” (Tight) 1.24 
NF “Stately” (Stately) 4.28 “Nyten” (Night) 1.28 
3. Lipstick 
F “Colory” (Color) 2.36 “Peenkis” (Pink & Kiss) 1.32 
NF “Shiney” (Shiny) 3.52 Cximist' (Cream) 1.40 
4. Eye shadow 
F “Blue-Lash” (Blue & Lashes) 3.36 “Loblu” (Blue) 1.36 
NF “Dark-Silk” (Dark & Silk) 3.88 “Lolid” (Lid) 1,28 
5. After-shave lotion 
F “Cooling” (Cool) 4.12 “Smelo” (Smell) 1.40 
NF “Healing” (Heal) 4,32 “Cozic” (Cozy) 1.20 
6. Smoking pipe 
F “Manly” (Man) 4.32 “Tobax” (Tobacco) 1.36 
NF “Clean” (Clean) 4.48 “Nicot” (Nicotine) 1.56 
7. Briefcase 
F “Executive” (Executive) 4.60 “Bizinet” (Business) 1.32 
NF “Important” (Important) 4.32 “Promets” (Promotion) 1.48 
8. Shirts (men’s) 
F “Tie-Match” (Tie) 2.48 “Dreso” (Dress) 1.40 
NF “Party-Fit” (Party) 2.52 “Mone” (Money) 1.36 
9. Writing pads 
F “Letters” (Letter) 4.32 “Leterit” (Letter) 1.52 
NF “Economy” (Economy) 4.44 “Thicrap” (Thick) 1.24 
10. Pen 
F “Writer” (Write) 4.36 “Incko” (Ink) 1.32 
NF “Family” (Family) 4.68 “Metto” (Met) 1.40 
11. Adhesive tape 
F “Sticky” (Sticky) 4.24 “Bandes” (Bandaid) 1.48 
NF “Sporty” (Sport) 4.20 “Landes” (Land) 1.40 
12. Folders 
F “Paper-Safe” (Paper) 2.88 “Filex” (File) 1.32 . 
NF “Camp-Sign” (Camp & Sign) 2.52 “Firex” (Fire) 1,40 
Í 


Brand AWARENESS 


tisement booklets were distributed in such a 
manner that each of the 48 brand names was 
rated by 12 Ss (6 male and 6 female Ss) on 
6-point meaningfulness and appropriateness 
scales. The 6 points in each scale were as- 
signed ordinal weights of 5 (very high) to 0 
(none) for the purpose of deriving the mean 
meaningfulness and appropriateness value for 
each brand name. The S’s ratings indicated 
that the 24 HM brand names have signifi- 
cantly higher meaningfulness values (mean 
m' = 2.74) than those of the 24 LM brand 
names (mean m’ = 1.89, t = 3.40, p < .01). 
The appropriateness ratings also indicate that 
the F brand names were considered more 
appropriate (mean appropriateness value = 
3.07) than the NF brand names (mean ap- 
propriateness value = 1.72, 1 4.65, p< 
.01). This suggests that Ss did perceive the 
HM and LM brand names selected by the 
experimenter as having high and low mean- 
ingfulness, respectively. Likewise, they also 
perceived F and NF brand names, respec- 
tively, as more and less appropriate for the 
products. 


Effects of Fittingness and Meaningjulness 


In order to determine the effects of fitting- 
ness and meaningfulness variables on brand 
awareness, a 2 X 2 X 2 analysis of variance 
(Lindquist, 1953, Type VI design) was per- 


TABLE 2 
ANALYSIS OF VARIANCE OF RECALL SCORES 


Source of variation | df MS F 
Between Ss 47 
S's sex (S) 1 15 <i 
Error (b) 23 2.65 
Within Ss 144 2 5 — 
Meaningfulness (M) 1 i . 
Fittingness (F) 1 24.08 66.89* 
MXF d 6.07 10.84* 
MXS 1 96 1.30 
FXS 1 27 <1 
MXFXS 1 35 <i 
Error (w) 138 55 
Errori (w) 40 74 
Errors (w) 46 36 
Errors (w) 46 56 
Total 191 

*p <.01. 


TABLE 3 


Max Recatt Scourt roe N awo 
MANL Beaxo Nags 
— 
| Fitting | Non&itting i 


Highmeaningful | 281 | 246 2 
Low meaningful | 236 10 =| Taree 
f 1.67 | 640% | 


Nete.—-Means are based oa the recall of all 48 . 
$p < 0s, 


formed on the recall of brand names. The 
three classifications were: high and low mean- 
ingful brand names, fitting and nonfitting 
brand names, and male and female Ss. The 
two main effects of meaningfulness and fitting- 
ness and their interaction were highly sig- 
nificant (Fs = 23.67, 66.89, and 10.84, re- 
spectively). These results are presented in 
Table 2. 

The mean recall of HM-F, LM-F, HM- 
NF, and LM-NF brand names by all 48 Ss 
presented in Table 3 suggests that brand 
awareness is better for fitting than for non- 
fitting brand names and for high meaningful 
than for low meaningful brand names. Fur- 
thermore, these data suggest that meaningful- 
ness variable does not influence brand aware- 
ness when the brand names are fitting ones 
(the difference between HM-F and LM-F 
recall yields a nonsignificant ¢ = 1.67). How- 
ever, when the brand names are of nonfitting 
type, the higher the meaningfulness the better 
is the brand awareness (the difference between 
HM-NF and LM-NF yields a ¢ = 6.40, p < 
01). This is the reason for the significant 
interaction between meaningfulness and fit- 
tingness. The male and female Ss did not dif- 
fer with respect to their recall of brand names 
(see Table 2). However, within each group 
the HM brand names were recalled signifi- 
cantly better than the LM brand names. For 
male Ss the recall means of HM and LM 
brand names were 5.27 and 3.87, respectively 
(t= 6.67, p < .01), and for the female Ss 
the corresponding means were 5.16 and 4.25, 
respectively ( = 4.33, P< 01). Similarly 
for both male and female Ss taken separately 
F brand names were recalled better than NF 
brand names. For male Ss the recall means 
of F and NF brand names were 5.21 and 3.83, 
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respectively (f =6.57, p < .01), and for fe- 
male Ss the corresponding means were 5.33 
and 4.08, respectively (t = 5.95, p < .01). 


Effect of Utility of the Product 


A 2 X 3 analysis of variance was performed 
on the recall scores of male and female Ss 
for the brand names of the three categories: 
male-use, female-use, and used-by-both prod- 
ucts. Out of the two main effects, the male 
and female Ss, and three product categories, 
only the latter approached significance (F = 
2.74, df = 2/92, 10> p> .05). The mean 
recall scores of the male Ss were 3.29, 3.12, 
and 2.87, and the corresponding recall scores 
of the female Ss were 3.17, 3.25, and 3.00 for 
male-use, female-use, and used-by-both prod- 
ucts, respectively. These results indicate that 
male and female Ss did not differ in their re- 
call patterns. The brand names for male-use 
and female-use products were recalled equally 
well by both the groups. There was a trend 
in both the groups to recall fewer brand names 
for the used-by-both products than those for 
the other two categories. 


Discussion 


The above results clearly confirm the earlier 
findings that brand names having higher 
meaningfulness values are retained better than 
those having low meaningfulness value (Ka- 
nungo & Dutta, 1966). In addition, the re- 
sults lend strong support to the hypothesis 
that a fitting brand name derived from the 
association norms of the product is retained 
better than a nonfitting brand name. The fit- 
tingness variable seems to influence brand 
recall not only when brand names are low 
meaningful ( = 7.07, p < .01) but also when 
they are high meaningful (f = 2.33, p < .05). 
The meaningfulness variable, however, influ- 
ences brand recall only when brand names 
are nonfitting (t= 6.40, p<.01) but not 
when they are fitting (= 1.67, p> .05). 
Such interaction effects may interest the ad- 
vertisers who want to choose between two 
HM or LM brand names. However, further 
replications are needed before the advertiser 
can place his confidence in such interaction 
effects. 

In practice, many advertisers choose brand 
names that they consider intuitively as fitting 
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to their products. Very often, such intuitive 
decisions are based upon one or a few indj- 
viduals’ remote associative responses to the 
products. As suggested in this study, a better 
method of choosing fitting brand names would 
be to use the conventional laboratory word- 
association techniques (Cofer, 1958) with a 
sample of potential consumers. Generally, the 
association techniques have been used for the 
purpose of studying language and verbal learn- 
ing processes (Deese, 1959; Noble, 1952) and 
for clinical diagnoses (Jung, 1910; Kent & 
Rosanoff, 1910). The present study extends 
its usefulness to the selection of brand names 
by advertisers. 

The pattern of recall for brand names of the 
three product-utility categories does not con- 
form to the pattern of results reported by 
Kanungo and Dutta (1966). The authors re- 
ported that their male Ss recalled fewer brand 
names of female-use products and their fe- 
male Ss recalled fewer brand names of male- 
use products than what was expected of them. 
These findings supported their hypothesis that 
brand awareness of high-utility products is 
superior to that of low-utility products. In the 
present study, however, both male and fe- 
male Ss recall male-use and female-use prod- 
ucts equally well. These differences in the 
results of the two studies are presumably due 
to the difference in the culture patterns of 
Ss. The Ss used by Kanungo and Dutta 
(1966) were drawn from three Indian educa- 
tional institutions where males and females 
live in a more or less segregated fashion. The 
cultural standards of India impose greater 
restrictions on free mixing among young un- 
married male and female students compared 
to the North American cultural standards. 
Thus, for example, a female student in India 
would have less knowledge and concern about 
male-use products such as cigarettes or shav- 
ing blades. Likewise a male student in India 
would be less exposed to and interested in 
female-use products such as perfume or nal 
polish. On the other hand, the male and fe- 
male Ss used in the present study share each 
others interests to a greater extent due t0 
greater exposure to each others interests in 
this culture. Thus it is not surprising that 
both male and female Ss in the present study 
may be showing equal interest in both male- 


awareness, 
plored more fully in the future. 
Finally, it may be recalled that a trend was 
noticed in both the male and female Ss to 
recall fewer brand names for the 


male and female Ss compared to the products 
in the other two categories, such as after 
shave lotion, nylons, or girdles, then the trend 
shown by the results are consistent with the 
hypothesis that brand awareness of high-util- 
ity products is better than that of low-utility 
products. The four products belonging to the 
used-by-both category may have been con- 
sidered somewhat less important and therefore 
of lower utility than the products belonging 
to the other two categories. Unfortunately the 
products were not rated on a utility scale by 
the Ss. Such ratings might have substantiated 
the above explanation. 
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AGE, CIGARETTE SMOKING, AND TESTS OF PHYSICAL FITNESS 


KENNETH H. DAVID + 


University of Hawaii 


It was hypothesized that smoking would decrease Ss’ performance on physical 
fitness tests calling for strenuous activity, but not for tests calling for mild 
activity. 88 Ss were given 5 physical fitness tests as part of their work require- 
ment and then asked to report the number of packages of cigarettes smoked 
per day. A multiple linear regression analysis of the results, using age as a co- 
variate and amount of smoking as the independent variable, supported the 
hypothesis. The smoking conditions resulted in significant differences for the 
most strenuous test (1-mi, Run) and the 2 moderately strenuous tests (Crawl- 
ing and Dodge & Jump), with performance scores decreasing as reported 
smoking scores increased. The least physically demanding tests showed no sig- 
nificant differences between smoking conditions. Age was an important factor 
in accounting for Ss’ performance on all of the tests. 


The present study was undertaken in order 
to investigate the relationship between vary- 
ing amounts of cigarette smoking and several 
tests of physical fitness. If cigarette smoking 
affects an individual’s performance on tests 
of physical fitness, the effect is probably 
mediated by the decreased efficiency of the 
circulatory and respiratory systems. It would 
appear probable that smoking reduces the 
oxygen capacity of the lungs; therefore, dur- 
ing physical activity the muscles may be de- 
prived of sufficient oxygen, leading to feelings 
of fatigue and lowered performance. More 
strenuous activity would be expected to place 
a greater demand upon the circulo-respiratory 
system for oxygen than would mild activity. 
Thus, if smoking decreases the efficiency of 
a person’s circulo-respiratory system, one 
would expect that smoking would decrease 
one’s performance on tasks in which the 
circulo-respiratory system is of major im- 
portance, that is, tests in which vigorous 
physical activity is required. Conversely, one 
would expect that smoking would not affect 
one’s performance on tests in which the 
circulo-respiratory system plays a minor role, 
that is, tests in which there is minimal physi- 
cal activity. 

In a study of the relationship between 
smoking and physical fitness, it was hypoth- 
esized that smoking would decrease Ss’ per- 
formance on tests calling for strenuous ex- 
ercise, and that greater smoking would result 
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in a greater decrease in performance. For fit- 
ness tests calling for mild physical activity, 
smoking was expected to have a negligible 
effect upon Ss’ performance. Thus, for the 
One-Mile Run test, which was the most stren- 
uous test, smoking was expected to result in 
a greater decrease in performance than for 
any of the other tests. The Dodge & Jump 
test and the Crawling test were considered to 
be moderately strenuous, thus smoking was 
expected to result in decreased performance, 
but to a lesser degree than for the One-Mile 
Run test. The Climbing test was considered 
to be a mildly active test, thus smoking was 
expected to result in a slight or no decrease 
in performance. The Throwing test called for 
a minimal amount of activity, thus smoking 
was not expected to decrease performance. 


METHOD 


Subjects. The Ss were 88 military personnel rang- 
ing in age from 19 to 39 yr. The physical fitness 
tests were administered to Ss as part of their normal 
work requirement. During the 6 wk. preceding the 
testing, Ss performed moderate calisthenics for 2 br. 
per week. The number of calisthenics was the same 
for all Ss, except that the older Ss (30-39 yr.) ee 
formed 10% fewer exercises than the younger °° 
(19-29 yr.). 

Tests. The five physical fitness tests were: 
Crawling test—S was required to crawl for a dis 
of 120 ft., sliding along the ground at all times. 
S's score was the amount of time required to on 
the distance. (B) Climbing test—S was required to 
swing from rung to rung on a horizontal E 
without touching the ground. The S’s score Was | 
number of rungs traversed during a 1-min. Pal 
(C) Dodge & Jump test—S had to run across 


(A) 
tance 
The 


296 


Ace, CIGARETTE Smoxinc, ano Frrxess 


TABLE 1 


Fitness test M SD 
Crawling | 31.59 sec | 6.28 
Climbing | 45.18 rung 13.40 
Dodge & Jump | 25.83 sec. | 1.87 
Throwing 23.93 pt. 8.07 
One-Mile Run 519.41 sec. 86,01 

Age 25.83 yr 5.51 


49-yd. obstacle course and return to the starting 
position. The S had to avoid the obstacles and jump 
over a ditch during the running. The S's score was 
the amount of time taken to complete the course. 
(D) Throwing test had to throw five grenades 
from a kneeling position, in which he attempted to 
hit the center of a target area 90 fl. away. The S’s 
score was based upon the combined scores of the five 
throws. Performance on this test is primarily a func- 
tion of psychomotor skills, rather than demanding 
physical endurance. (E) One-Mile Run test was 
required to run around a 1-mi, grass track. The S's 
score was the amount of time taken to run the dis- 
tance. This task is a physically demanding test, in 
which physical endurance is of primary importance. 

Procedure. The Ss were separated into four groups 
of 22 Ss each, assigned according to the alphabetical 
order of their last names. All four groups were given 
the Crawling test first and the One-Mile test last, 
with the other three tests given in a counterbalanced 
order. The order of the tests for the four groups 
was: ABCDE, ACBDE, ACDBE, and ADCBE. The 
Ss were instructed to complete the four timed tests 
as quickly as possible and to strive for accuracy in 
the Throwing test. 

After completing the tests, Ss were required to 
write down the number of packages of cigarettes 
smoked per day, according to one of six choices: 
(a) none, (b) less than 4 per day, (c) 3 to less than 
1, (d) 1 to less than 14, (e) 14 to less than 2, and 
(f) 2 or more. The number of Ss falling within 
each category were: a= 19, b=14, C= 13, d 14, 
e= 14, and f 14. 


RESULTS AND Discussion 


The age of the Ss appeared to be correlated 
with their performance scores; therefore, 
Pearson coefficients between age and perform- 
ance were computed for the five physical fit- 
ness tests. The coefficients between age and 
the Crawling, Climbing, Dodge & Jump, 
Throwing, and the One-Mile Run tests were 
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772, —.556, 623, —.688, and 598 (p < 01, 
df = 86), respectively. 

The results were analyzed by means of a 
multiple-regression analysis (Bottenberg & 
Ward, 1963). A test for an interaction effect 
between age and the six smoking conditions 
was made for each of the physical fitness 
tests, with no significant differences occurring 
(p> .05). Since age was significantly cor- 
related with the fitness tests, and age and 
smoking showed no significant interactions, 
age was used as a covariate with amount of 
smoking as the independent variable. The 
analysis showed that there were significant 
differences between the smoking conditions 
for three of the five physical fitness tests (see 
Table 1). The smoking variable showed sig- 
nificant differences for the Crawling test (F 
= 3.07, dj = 5/81, p < .02), the Dodge & 
Jump test (F = 6.56, dj = 5/81, p < .001), 
and the One-Mile Run test (F = 13.09, df = 
5/81, p<.001), with no significant differ- 
ences found for the Climbing test and Throw- 
ing test (p > .05). 

In order to determine which of the smoking 
conditions were significantly different from 
each other, an analysis was made between 
each of the smoking conditions for the Crawl- 
ing, Dodge & Jump, and One-Mile Run tests; 
thus, 45 comparisons were made using age as 
a covariate. A comparison of the six smoking 
conditions for the Crawling test showed that 
11 of the 15 comparisons gave significant dif- 

2 Appreciation is expressed to Frank Ahern for the 
analysis and to the Computing Center of the Univer- 
sity of Hawaii. 
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ferences (p < 01) in the predicted direction, 
while the other four comparisons showed no 
significant differences (p > .05). The Dodge 
& Jump test showed 13 significant differences 
between the smoking conditions, with the two 
nonsignificant differences occurring between 
conditions 4-1 and 1-14 packages (p > .05) 
and 1-14 and 13-2 packages (p > .05). The 
significant differences were in the predicted 
direction, with greater smoking resulting in 
greater decrease in performance. The One- 
Mile Run test showed significant differences 
between all 15 comparisons of the smoking 
conditions (p < .01 for 14 of the comparisons, 
and p< .02 for the other). Thus, for the 
Crawling, Dodge & Jump, and One-Mile Run 
tests, an increase in self-reported smoking was 
associated with a decrease in performance. 
The analysis of the results supported the 
hypotheses. The smoking conditions had a 
highly significant effect upon the most strenu- 
ous of the physical fitness tests, the One-Mile 
Run test, with smoking accounting for ap- 
proximately 29% of the variance. Smoking 
also showed significant effects for the two 
moderately strenuous tests, with the Dodge & 
Jump and Crawling tests accounting for ap- 
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proximately 18% and 6% of the variance, re- 
spectively. Smoking showed no significant ef- 
fects for the two least physically demanding 
tests, the Climbing and Throwing tests. There- 
fore, the results support the position that 
smoking results in decreased performance of 
physical fitness tests which require strenuous 
activity, and with a decrease in the required 
endurance of the test there is a corresponding 
decrease in the effect of smoking. 

The age factor was of greater importance 
than smoking in accounting for the differences 
between Ss’ scores on the physical fitness tests 
(see Table 1). For the Crawling, Climbing, 
Dodge & Jump, Throwing, and One-Mile 
Run tests, the age factor accounted for ap- 
proximately 60%, 31%, 39%, 47%, and 36% 
of the variance (p < .001), respectively. 
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ABILITY, VALUES, AND CONCEPTS OF EQUITABLE SALARY 
INCREASES IN EXERCISE COMPENSATION 


BERNARD M. BASS 
Management Research Center, University of Rochester 


113 American graduate business students recommended annual salary increases 
for 10 fictitious engineers, cach with a different reason for being given a rabe. 
The mean recommendation was 9.3%, Recommenders were more generous if the 
students were of lower intelligence and achievement, and if they held strong 
social and service rather than theoretical and economic values. They sbo were 
more willing to spend company money for other social and personnel perposes. 


Several recent reviews point to increasing 
interest in psychological aspects of pay 
(Adams & Jacobsen, 1964; Haire, Ghiselli, & 
Porter, 1963; Weick, 1966). A modest amount 
of available research has probed the effects 
of pay differentials on attitudes and perform- 
ance of those receiving the money, but little 
has been done to ascertain what motivates 
those who decide on what salaries will be paid 
when they respond to requests for salary in- 
creases and how their own ability and values 
affect their monetary decisions. These latter 
issues are the concern of this report. 


METHOD 
Subjects 


Forty-four percent of the 113 male graduate busi- 
ness students who participated in this study held 
undergraduate engineering degrees, the other 56% 
had earned undergraduate degrees in the liberal arts 
or in business administration. They averaged close 
to “B” in undergraduate grades and earned a mean 
of 566 on the Admissions Test for Graduate Schools 
of Business (ATGSB). 


Procedure 


During the fourth week of a course in behavioral 
science, each participant was involved in a series of 
small group exercises (Bass, 1965, 1967). One of 
these, Exercise Compensation First ed.), required 
that each participant, by himself, make a set of 10 
salary recommendations prior to a group discussion 
about the matter. He was given the instructions 
which in essence were as follows: 


You will have to recommend increases for 10 
engineers in your company, all of whom are un- 
married, 22-year-old college graduates who have 
just completed their first year with the company 
and are now to be considered for an annual raise. 
Keep in mind that you may be setting precedents, 
that you need to keep down salary costs, that you 
need to consider equity, that you want to retain 


all the engineers if pomible. Your company has no 

job evaluation schedule but tries to match com- 

munity and industry norms and to be competitive. 
The 10 engineers are as follows: 


Al, an average performer (SOth percentile) ; 
Bill (90th percentile in merit) ; 
Charlie (10th percentile in merit). 
All the remainder are like Al in merit, but in 
addition: 
Dan is working on an insecure government con- 
tract. Because of his specialty, transfer is dif- 


ficult. 
Ed is on a “dead-end” job lacking advancement 


potential. 

Frank has a dirty, unpleasant, hazardous, uncom- 
fortable job. 

Garry has a low-prestige job in a low-prestige 


subsidiary. 

Henry has to work closely with a crew of uncon- 
genial, unfriendly, competitive co- 

Irwin has a boring, dull, and monotonous job. 

Jim has a definite job offer from a competitor sub- 
stantially better than his current salary. 


Other Measures 


Of the 113 participants, during other weeks of the 
course, 112 completed the Allport-Vernon Study of 
Values and ranked their own life goals on the first 
edition of Exercise Life Goals (Bass, 1965). The 
goals were defined as follows: 


(a) Leadership: To become a community leader, to 
become influential in public affairs, to become in- 
fluential among co-workers. 

(b) Comfort: To have the time and means to en- 
joy life, to become happy and content, to have 
financial security, to be well-liked. 

(c) Creativity: To produce original and unique 
plans and ideas, to devise new approaches, to be 


expressive. 

(d) Expertness: To become an authority on a spe- 
cial subject in any field, to persevere to reach a 
hoped-for expert level of skill and accomplishment. 

(e) Technology: To make contributions to tech- 
nical or scientific knowledge, to invent apparatus or 


equipment. 
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TABLE 1 


RELATION BETWEEN TESTED INTELLIGENCE ACHIEVEMENT AND COMPENSATION DECISIONS 


'. Mean % = 
- ATOSE Undergraduate | increase in SD of 
Recommendations for grades? ERREN E recommen- 
Ls Q Ta dations? dations* 
Al (average) —.08 | —.18* | —.17* —.19* 8.02 | 2.62 
Bill (best) 08 14 BG) PS yi 11.96 3.78 
Charlie (poor) r teat Wa —.23* .00 5.55 | 2.38 
Dan (no security) —.09 —11 —.15 —.03 9.83 3.24 
Ed (no opportunity) — 42 —.21* —.22* —.09 9.05 
Frank (poor conditions) —.01 —.24** —.16* —.04 10.18 
Garry (no prestige) —.10 —.21* —.20* —.14 9.49 
Henry (unfriendly co-workers) | —.05 —.21* —.18* —.13 9.72 
Irwin (uninteresting work) 00 —.19* —.12 —.14 
Jim (competitive offer) —.02 —.16* —.11 —.08 


Note.—ATGSB = Admissions Test for Graduate Schools of Business. L = Linguistic; Q 
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(f) Prestige: To become well-known, to obtain 
awards and recognition. 

(g) Service: To contribute to the satisfaction of 
others, to be helpful to others who are in difficulty. 

(h) Wealth: To earn a great deal of money, to 
build a large financial estate. 

(i) Independence: To be free to do what one 
likes, to be one’s own boss, to control others rather 
than be controlled by them. 


Exercise Objectives (Bass, 1965) also was com- 
pleted by the 113 participants. In this activity, after 
reading a company’s last annual profit and loss state- 
ment, each participant made five budgeting decisions. 
In one of these he had the option to spend or not to 
spend $225,000 to correct a safety hazard likely to 
produce a serious injury. In another, he could spend 
or avoid spending $150,000 to modify a plant that 
was polluting a stream. In still others, he had to 
spend money on managers, products, and to settle 
a strike, but in these cases his problem was on which 
way to spend the money to handle each case. These 
latter decisions were not expected to relate to salary 
recommendations and in fact they did not do so. 


RESULTS 
Mean Recommendations 


The mean increase awarded the 10 engi- 
neers by the 113 graduate business students 
(Table 1, column 5) was 9.3%. As is common 
with most other groups of managers and stu- 
dents tested here and abroad, the largest 
source of variance in recommendations was 
merit. Bill, of high merit, was assigned 1.5 
times the increase awarded Al, the average 


was given approximately .7 as much as aver- 
age Al. The hypothesis has been advanced 
(Bass, 1967) that half of the increase given 
Al, the average man, will be denied to Charlie. 
This half will be added to Bill’s increase so 
that Bill gets 1.5 times as much as Al, and 
Charlie is left with only .5 of the increase 
given Al, But the hypothesis did not hold 
completely in Charlie’s case. Here, the ratio 
of mean recommendations that occurred for 
Bill: Al: Charlie was 3:2:1.5, 
Seas 

As seen in Table 1, column 5, next to 
meritorious performance the participants were 
most willing to pay for poor working condi- 
tions (Frank) and were relatively least re- 
sponsive to Ed who lacked opportunities, al- 
though in comparison to what they awarded 
average Al, they were willing to give 4 
premium for everyone from Dan to Jim who 
had special problems. 


rather than 


Correlational Analyses 


The main purpose of this report is to see 
what personal factors influenced the recom- 
mendations. For this, correlational analyses 
will be presented in four sections. First, the 
extent that salary recommendations are as- 
sociated with the intelligence and achievement 


3.00 

3.08 

3.16 

3.10 

9.83 3.26 

9.34 3.32 

iness. L = Linguistic; Q = Quantitative; T = Total. 

man, while Charlie, the man of low merit, 
b 


Sala Incegases 


of the recommender will be examined. Next, 
will be seen how the self-reported values of 
the recommender may enter the picture. 
Third, a look will be taken at how the recom- 
mender’s own stated life goals correlated with 
his recommendations for raises in salary. 
Finally, the author will note how generosity 
in recommending raises also may relate to 
willingness to spend company money on other 
personnel and organizational needs. 


Ability and Compensation Decisions 


Table 1 shows the product-moment cor- 
relations between the salary increases each 
participant was willing to award each of the 
10 engineers and each participant’s tested in- 
telligence and undergraduate record (columns 
1-4). It can be seen that differences among 
the participants’ decisions were unrelated to 
verbal aptitude as measured by the ATGSB, 
except possibly the salary decisions about 
Charlie, who was lacking in merit. However, 
a modest association was found between 
quantitative aptitude and the tendency to 
recommend smaller raises in salary. Partici- 
pants with greater tested quantitative apti- 
tude awarded smaller salary increases, par- 
ticularly (p < .01) when the increases were 
being justified by poor working conditions 
(r = —.24). When verbal and quantitative 
scores were combined and total intelligence 
test scores were correlated with compensation 
decisions, the effect was most pronounced in 


ing was the significant correlation 


mended smaller salary increases for the most 
meritorious engineer. They awarded Al, the 
average engineer, a smaller raise also. There 
was little other evidence of the influence of 
undergraduate achievement on the other com- 
pensation decisions. 

To sum up, some tendency was seen for 
brighter and more academically successful 
participants to recommend smaller raises (or 
for the duller and less successful to recom- 
mend bigger salary increases). Quantitative 
aptitude was more of a factor than verbal 
aptitude. 


Values, Liſe Goals, and Compensation Decision 


The correlations among the Allport-Vernon 
Study of Values scale scores and the salary 
recommendations are shown in Table 2. It can 
be seen that holding certain values predis- 
posed recommenders to be generous, while 
strong values in other directions did the re- 
verse. Those with strong social values tended 
to recommend bigger raises in general to all 
engineers, but the effects were less marked 


TABLE 2 
ATION BETWEEN ALLPORT-VERNON Srupy or VALVES AND COMPENSATION Decisions 


— — —— 


Recommendations for 
Al (average) —:23** = 38° 
Bill (best) 3 —18* 
Charlie (poor) —.20˙ oy . 
Dan (no security) . aa 
Ed (no opportunity) -237 e 
Frank (poor conditions) ee * 
Garry (no prestige) -290 men 
Henry (unfriendly co-workers) | — 2475 aes 
Trwin (uninteresting work) cay la 


Allport-Vernon scores 


10 — 03 16° 
=.09 07 11 
08 —.06 17* 
—.10 Al 10 
— 01 04 14 
— 05 0⁰ 19˙ 
03 04 16" 
04 08 12 
05 09 17 
02 07 17 


! ĩ T 


Note. N = 112. 
*p S. 05. 
ED S. 01. 
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TABLE 3 
RELATION BETWEEN SELF-RANKED LIFE GOALS AND COMPENSATION DECISIONS 
Self-ranked life goals 
Recommendations for . Inde 
Toader ii Cap: Crean aj Expert mer Prestige | Service | Wealth | pend- 
ship fort tivity ness nology | 
| ence 
Al (average) | 10 —.07 —.11 —.10 —.09 02 2 —.03 01 
Bill (best) „„ 09 22% —.06 | —.10 
Charlie (poor) 00 —.11 | —.09 —.00 —.12 —.00 .18* —.03 02 
Dan (no security) 10 08 14 Ala 19* 06 27 —.00 04 
Ed (no opportunity) AZ .09 1 07 19 * .01 28 02 02 
Frank (poor conditions) .12 07 11 07 13 —.00 oe —.04 —.04 
Garry (no prestige) 11 —.12 | —.10 —.13 —.16* .03 27 .04 —.01 
Henry (unfriendly 09 00 .06 12 .18* 03 127 02 | 06 
co-workers) | 
Irwin (uninteresting 07 —.05 | —.06 —.14 —.20* 02 2 —.00 04 
work) | 
Jim (competitive offer) “ton —.03 —.15* | —.16* | —.21* 03 27 02 02 


Note.—Signs reversed, N = 113. 
* <.05. 
* <.01. 


when dealing with Bill (most meritorious), 
Frank (poor working conditions), and Jim 
(competitive offer). Evidently, lack of se- 
curity, opportunity, prestige, congenial co- 
workers, and interesting work weighed heavier 
as an excuse to grant bigger raises for recom- 
menders scoring high in social values. 

The correlations also were uniformly posi- 
tive between religious value scores and the 
size of salary recommendations, but they 


TABLE 4 


RELATION BETWEEN WILLINGNESS TO SPEND MONEY 
ON SAFETY AND STREAM CLEANING AND 
CoMPENSATION DECISIONS 


Spend on 
Engineer Spend on 1 
safety 
clean-up 
Al (average) 07 009 
Bill (best) 10 14 
Charlie (poor) 00 alii 
Dan (no security) 235 AL 
Ed (no opportunity) .20* 13 
Frank (poor conditions) 18* 11 
Garry (no prestige) Abe 11 
Henry (unfriendly co-workers) 21 13 
Irwin (uninteresting work) .19* 16* 
Jim (competitive offer) 2 13 


Note.—N = 113, 
*p <.05. 
* p < 01, 


were generally lower. Strong social values 
played a greater role than strong religious 
values. At the other extreme, recommenders 
holding strong economic and theoretical values 
according to their self-reports were likely to 
offer smaller raises to the engineers. Those 
holding strong economic values were particu- 
larly unlikely to be generous to Al (average 
merit) and Charlie (low merit). Those hold- 
ing strong theoretical values were particu- 
larly unlikely to be generous to George (no 
prestige) and Irwin (unfriendly co-workers). 

Table 3 shows how self-ranked life goals 
were related to compensation recommenda- 
tions. Here one goal stands out. Recom- 
menders with relatively greater concern for 
service as an important life goal tended to 
be more generous—almost uniformly 50. 

To a lesser extent, those concerned about 
technical success were likely to be less genet 
ous than the average recommender. 

Different reasons for being generous may 
or may not be involved. For those high in 
economic value, it may be that the same 
amount of money is worth more and is con- 
sidered to be a larger raise than would be 
seen by those low in economic value. For 
those high in concern for theory, understand- 
ing, and technical success, less need may. 
seen to grant large increases in compensation 
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for effort. For those with strong concerns for 
social service and religion, pure generosity 
may be operative in their salary recommenda- 
tions. 


Willingness to Spend Money 


How general is generosity? Two of the 
five budgeting decisions on Exercise Objec- 
tives involved whether or not to spend money 
(not how to spend it) primarily for human or 
social rather than immediate economic re- 
turns. As can be seen in Table 4, there was a 
slight positive association between the size of 
salary increases recommended and a willing- 
ness to spend money on fixing a plant to end 
pollution of a stream. There seemed to be an 
even greater tendency to provide extra money 
for raises for all kinds of special reasons other 
than merit (r’s ranged from .18 to .23) and 
a willingness to spend money on safety equip- 
ment to avoid the possibilities of a serious 
accident, 


CONCLUSIONS 


Despite the observation that “Pay, in one 
form or another, is certainly one of the main- 
springs of motivation in our society” (Haire 
et al., 1963), little data are available on what 
bases decisions about pay are made. Many 
people would like to believe that highly ob- 
jective analyses produce highly objective job 
evaluations and merit rating and manpower 
plans which yield highly objective standards 
for establishing pay and pay raises. In fact, 
the author suggests that much subjectivity of 


factors in consideration by the recommenders 

of salary increases, at the same time, as might 

have been anticipated, the abilities and values 
of the recommenders played some role in 
what values they were willing to recommend. 

Higher salary increases were recommended 
by those lower in intelligence and achieve- 
ment, by those who held strong social and 
religious rather than theoretical or economic 
values, who were more interested in service 
rather than technical achievement and who 
also were more willing to spend their com- 
pany’s money for other personnel and social 
purposes, 
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RELATIONSHIP BETWEEN PERCEPTUAL STYLE 
AND SIMULATOR SICKNESS 


GERALD V. BARRETT anp CARL L. THORNTON 


Goodyear Aerospace Corporation, Akron, Ohio 


Simulator sickness was hypothesized to be caused by the conflict between the 
visual presentation of apparent motion and the lack of any corresponding body 
sensation of motion. The hypothesis was tested by correlating individual differ- 
ences in scores on the Rod and Frame Test (RFT; which measures accuracy of 
adjustment of a rod to true vertical under conditions of visual-kinesthetic con- 
flict) and degree of simulator sickness. The data for Series 3 of the RFT and 
the indexes of sickness were best represented by hyperbolic functions yielding 
correlations of .40-.52, Implications for simulation technology and for a general 
conflict of cue theory are discussed with emphasis on supporting evidence 


from several areas of investigation. 


Since World War II, simulators have been 
developed which give the visual illusion of 
motion without any actual physical motion. 
Unfortunately, certain Ss became ill while 
operating these devices. The illness phe- 
nomenon was first intensively studied by Mil- 
ler and Goodson (1958, 1960), who labeled it 
motion sickness because the symptoms re- 
sembled those experienced by some people in 
moving vehicles. There was, however, no rea- 
son to prematurely so label this phenomenon 
motion sickness since motion is not involved 
in fixed-base simulators. As Tyler and Bard 
(1949) have pointed out, the primary cause 
of motion sickness is probably motion, and 
the failure to appreciate this fact can lead to 
confusion in conceptualization. In this paper 
the term “simulator sickness,” rather than 
motion sickness, will be used to denote the 
symptoms which occur in fixed-base simu- 
lators incorporating a moving visual scene. 

A number of hypotheses have been in- 
formally advanced to explain simulator sick- 
ness: distortion of vertical objects, rapid 
change in brightness, too much detail, poor 
resolution, excessive lag between simulator 
controls and corresponding shift in visual dis- 
play, high-frequency vibrations which disrupt 
accommodation, distance between the visual 
display and the observer such that accom- 
modation is different from that usually ex- 
perienced, and conflict between the apparent 
motion seen on the visual display and lack of 
any corresponding motion of the simulator. 


This last hypothesis was investigated in the 
present study. 

Several other conflict situations have been 
found to produce sickness. Wood (1895) de- 
scribed an amusement park device with a large 
immobile swing inside a movable room. When 
the room moved many individuals experienced 
considerable discomfort. Crampton and Young 
(1953) induced nausea in Ss seated in a fixed 
chair in the center of a rotating room. Dis- 
comfort lasted for some time after the ex- 
perience, for one S up to 2 days. This wide 
range of sickness time (from 0 to 48 hr.) is 
noteworthy, since all Ss were subjected to the 
same experience. 

While conflict between visual and body 
cues may be the dominant cause of simulator 
sickness, other factors such as type of simu- 
lator, fidelity of simulation, S's experience, 
and S’s involvement are also important. Fixed- 
base simulators may have two types of visual 
displays: outside-in (O-I), such as a child's 
remote control car, or inside-out (I-O), in 
which the operator views the scene as he 
would from inside a real vehicle. While sick- 
ness has been quite common with an I. 
display, none has been reported for O-I simu- 
lators. Low involvement resulting from being 
unable to put oneself psychologically into the 
vehicle is probably the reason that O-I dis- 
plays do not cause sickness. A comparison of 
I-O and O-I displays (Matheny, Dougherty, 
& Willis, 1963) revealed that performance 
improved with an I-O display (but not 0-1) 
when motion cues were added. Since the op- 
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erator “expects” motion in an I-O situation, 
cue conflict which leads to sickness and/or 
decrement in performance can occur when 
such expected motion is not present. 

Concerning S’s experience, Miller and Good- 
son (1958) found that 60% of the experienced 
helicopter pilots but only 12% of the student 
pilots tested reported simulator sickness. 
There was probably no cue conflict for the 
student pilots since they had not learned the 
particular body inputs which occur during 
helicopter operation. Fitts (1951) suggested 
that visual control was important when an 
individual first learns a perceptual-motor 
task, and proprioceptive feedback or “feel” 
becomes more important as experience in- 
creases. Fleishman and Rich (1963), for ex- 
ample, using predictors of a perceptual-motor 
task, found that a visual test correlated sig- 
nificantly with the task in early but not later 
trials. Conversely, a kinesthetic test cor- 
related on the later but not the early trials. 
This indicates that initially visual cues guide 
performance, but with experience individuals 
sensitive to proprioceptive cues can become 
most proficient. 

The importance of body cues was also 
demonstrated by a factor analysis of 12 driv- 
ing tests (Herbert, 1963). One of the main 
factors was labeled proprioception because of 
the high loading of a nonvisual driving task. 
Experience with the necessary proprioceptive 
feedback, then, appears to be necessary before 
cue conflict can result. 

Fidelity of simulation would be the degree 
to which the simulated conditions approach 
conditions of the real world. If the simulator 
were of low fidelity no conflict or sickness 
would result since Ss cannot become involved 
in the very unrealistic task. 

The Ss will also not become involved in 
the task if they have a “play set.” Their be- 
havior will not be pertinent to good driving 
performance with, for example, an attempt to 
crash the simulated vehicle. Low involvement 
yields no conflicting cues and therefore no 
simulator sickness. This conclusion is con- 
sistent with the finding that people do not 
become ill in amusement park simulators even 
though all the other necessary conditions may 


be present. 
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Recently Barrett and Nelson (1965d, 1966b) 
evaluated an automobile simulator which had 
all the aforementioned parameters: high 
fidelity, I-O display, and experienced and in- 
volved Ss. They found symptoms which were 
quite similar to those reported in previous 
simulator research, including cold sweating, 
upset stomach, vertigo, dizziness, nausea, 
feeling of faintness and disorientation. About 
half of the Ss became too ill to continue after 
only 5-10 min. Two Ss became so ill as to 
regurgitate. 

A research program was initiated to test 
the cue conflict and conflict sensitivity hy- 
pothesis, as measured with a Rod and Frame 
Test (RFT) apparatus following the field- 
dependence conceptualization of Witkin, 
Lewis, Hertzman, Machover, Meissner, and 
Wapner (1954). Field-independent Ss were 
deemed more sensitive to body cues than field- 
dependent Ss. On this basis it was predicted 
that the field-independent Ss would experi- 
ence more discomfort in the simulated situa- 
tion than field-dependent Ss. 


METHOD 


Automobile Simulator 


An unprogrammed automobile simulator was the 
basic research tool. A terrain model, an 87:1 (HO 
gauge) scale representation of several flat roads, sup- 
plied the visual scene. Mounted above the terrain 
model was a television camera with motions in di- 
rect response to the movements of the brake, ac- 
celerator, and steering wheel of the automobile. Thus, 
S had complete control over the part of the ter- 
rain model that the camera traversed. 

A projected image visual display was used with 
half of the Ss and a virtual image display with the 
remainder, Both visual displays gave the driver ap- 
proximately a 50° horizontal angle view of the ter- 
rain model and a center resolution of approximately 
$00 lines. There was no significant difference between 
the displays in the percentage who left the simulator 
because of discomfort. Therefore, the two displays 
were considered to be functionally equivalent. With 
each visual display the driver sat in the automobile 
and performed the usual control movements as- 
sociated with driving a car. A more detailed descrip- 
tion of the simulator and associated visual displays 
has been reported by Barrett and Nelson (1965c, 
1966a). 

Perceptual style was measured with a standard 
RFT (Witkin et al, 1954). The apparatus consisted 
of a luminous frame (40 in. sq.) pivoted at its 
center so that it could be tilted left or right. Pivoted 
at the same center, but moving independently of 
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TABLE 1 


RELATIONSHIP BETWEEN PERCEPTUAL 
STYLE AND DISCOMFORT 


Rod and frame measures 
Measure of 
discomfort | 


| 

178. | WS: | 1S 1/01 

Discomfort | 14 13 40. 23 
Illness after 22 | 31 | 2 Aas 
No. trials. | 22 | —11 45,25 
Stayed left | . 18 | —.14 | —A7** | —.26 


Note.—S: =Series 1; S: = Series 2; Ss = Series 3; OI = Orien- 
tation Index. 
*p <.01. 
++p <.001. 


the frame, was a luminous rod (39 in.). The S was 
seated 8 ft. from the rod and frame in a chair which 
could be placed in three positions: erect, tilted 28° 
to the left, or tilted 28° to the right. Testing was 
done in a completely darkened room and S wore 
goggles with dark lenses so that he could see only 
the rod and frame. 


Subjects 


A random sample of 50 male Ss between the ages 
of 30 and 45 were selected from approximately 1,200 
employees in an engineering division of an aerospace 
corporation. Approximately 6 mo. after the comple- 
tion of the simulator evaluation, 46 Ss (23 from each 
display) were able to be recontacted and tested with 
the RFT. 


Procedure for Automobile Simulator 
Investigation 


Each S drove three orientation trials around the 
terrain model followed by a pretrial run for a study 
of driving at requested speed. After making the 
tenth and final speed judgment, S was exposed to an 
emergency situation where he had to stop for a 
suddenly emerging pedestrian dummy. The pro- 
cedures have been described in greater detail by 
Barrett and Nelson (1965a, 1966b). During the evalu- 
ation of the simulator, & was observed by the ex- 
perimenter. If S complained of discomfort he was 
told that he was free to leave at any time. 


RFT Procedure 


Approximately 6 mo. after the data emergency be- 
havior study, the perceptual style of Ss was measured 
following the standard procedure (Witkin et al., 
1954). Series 1 (S:) of the RFT consisted of eight 
trials in which $ and the frame were tilted 28° in 
the same direction; Series 2 (Ss) were eight trials in 
which S and frame were tilted 28° in opposite di- 
rections; Series 3 (Ss) consisted of eight trials with 
the frame tilted 28° to the right or left while S re- 
mained upright. The S’s task was to position the rod 
to what he considered to be true vertical by asking 
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the experimenter to move the top of the rod right 
or left. The S’s score was the number of degrees in 
error in each series. In addition, an Orientation In- 
dex (OI) was computed from the standard scores of 
the three series. 


RESULTS 


Four measures of discomfort were com- 
pared to RFT scores. The first measure was 
S’s rating of discomfort, using a 0-10 graphic 
rating scale. The scale was part of a 10-item 
questionnaire concerning the simulator which 
was administered 6 mo. after the simulator 
study. A second questionnaire measure was 
S’s estimate of the length of time after leaving 
the simulator that the discomfort persisted— 
termed “illness after.” Responses to this ques- 
tion ranged from zero illness to 48 hr. Third, 
the number of trials S was able to remain in 
the simulator was used as an index of dis- 
comfort. The range was from 1/4 trial to 
completion (14 trials), Fourth, Ss were di- 
vided into two categories: those who com- 
pleted all trials and those who did not. Twenty- 
three of 46 Ss were able to complete the 
simulator study. 

Linearity was approximated by reciprocal 
transformations of the perceptual style mea- 
sures. Correlations are shown in Table 1 
where it can be seen that the only consistently 
significant relationships were between Ss and 
the four measures of sickness. 

Table 2 shows an apparent threshold phe- 
nomena. The Ss were classified according to 
an adult standardization sample for Ss (Wit- 
kin et al., 1954) with those who were either 
1 standard deviation above or below the mean 
labeled extreme field dependent or extreme 


TABLE 2 


RELATIONSHIP BETWEEN LEAVING SIMULATOR AND 
PERCEPTUAL STYLE CLASSIFICATION 


ON SERIES 3 
D xtreme 
Extreme | pied | Field | “Geld 
Subject inge inde- | depend- depend- 
pendent pendent ent ent 
— 
Left simulator} 12 6 2 3 
(N = 23) 
Remained in 21 2 
simulator 
2 
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field independent. All the extremely field-in- 


dependent Ss left the simulator. 


In order to determine if other aspects of the 
RFT test would significantly add to the re- 
lationship, multiple-regression equations were 
calculated using x, , 1/x, in a-data trans- 
formations of St, Ss, Ss, and OT. No measures 
were found to add significantly to the variance 
accounted for by Ss. 

Mention should be made of the possible rea- 
son for S; being the only measure which was 
related to simulator discomfort. Witkin et al. 
(1954) statistically analyzed 10 perception 
subtests and found three general groupings. 
Sz was not in the same grouping of perception 
tests as were St and Sq, indicating that the 
RFT taps at least two perceptual factors and 
that only one of them was related to simu- 
lator sickness. 


Discussion 


If the conflict of cue hypothesis is correct, 
an obvious remedy to the simulator sickness 
problem is to introduce a certain degree of 
physical motion into the simulated system. 
An interesting question is the degree of mo- 
tion required to give the necessary body cues. 
Simple random vibration may be enough to 
eliminate the cue conflict, a possibility having 
considerable practical and economic import 
for the simulation art. 

Besides the specific cue conflict when mo- 
tion is lacking, the results may have implica- 
tions for motion sickness research, in which 
case motion is present with inappropriate or 
missing visual cues. This is supported by the 
finding that those who experience motion 
sickness were also likely to experience sick- 
ness in a conflict situation (Crampton & 
Young, 1953). It has been reported (Clark, 
1963) that some pilots become disoriented 
while flying in extreme haziness or cloud 
covering. They may be receiving adequate 
motion cues but not the corresponding visual 
inputs. In much the same manner, while flying 
under instrument conditions (again motion 
but no vision), some pilots become dis- 
oriented and mistrust their instruments. 

Simulation of zero gravity in aircraft pro- 
duced illness for 50-70% of the Ss tested by 
Loftus (1963). It is possible that cue con- 
flict experienced in this unusual situation in- 


no signs of zero gravity sickness, while 64% 
of the normal Ss did, supports this hypothesis 


flict in cues. 

It is possible that both cue conflict and 
vestibular stimulation are important, but under 
different conditions, Walsh (1962) 
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oriented, with some experiencing physical dis- 
comfort akin to simulator sickness. The field- 
dependent Ss who function in this manner 
may also experience simulator sickness. To 
probe this possibility the 10 most-field-de- 
pendent Ss were compared as to variability of 
responses. Of the five sick Ss, four were ex- 
tremely variable (range = 20°) and four of 
the five nonsick Ss were very consistent in 
their responses (range = 5°). While the re- 
sults were suggestive, a large subsample would 
be required for statistical confirmation. 
Although cue sensitivity in terms of per- 
ceptual style appears to explain the extreme 
discomfort that some people experience, Wit- 
kin, Dyk, Faterson, Goodenough, and Karp 
(1962) recently discussed perceptual style in 
terms of being able to extract an item from 
an embedded context. The results of the study 
could be explained in these terms also. In a 
simple laboratory study of kinesthetic sensi- 
tivity where cues are isolated there may be 
no differences between field-dependent and 
field-independent Ss, However, in the com- 
plex simulator (an embedding context) the 
field-independent person may be more aware 
of the cues which are in conflict (i.e., can 
disembed them), and thus he becomes ill. 
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The aim of this research was to try out with 
electronic technicians a new 


the program by means of overt responses. 
One type and format of programmed in- 
struction, due to Crowder (1960), is the fol- 
lowing: A page of instructional material is 
ended by a question with several multiple- 
choice answers. Only one of these choices is 
correct. The choices are answered and dis- 
cussed on separate pages distributed at ran- 
dom throughout a given chapter. If a student 
selects a wrong answer, the wrongness of that 
particular answer is discussed in view of the 
instructional material and the student is 
usually returned to the original instructional 
page to reread it and to try again. If a student 
selects the correct answer, he is told that this 
is the correct answer and an explanation is 
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TABLE 1 
List AND DESCRIPTION OF VARIABLES 
Variable Level 
Fixed 

Sequence of forms in pretest to posttest AE 
Book format, linearized or scrambled | 2 

Card participation, preperforated IBM cards | 
used to record answers, or not 2 

Random | 
Subjects, electronic technicians 25 

| 


Posttests were developed and given to measure 
achievement gain, In addition to developing the 
linearized version, the scrambled version was re- 
vised on the basis of previous tryouts, 

The experimental design follows (Winer, 1963). 
(See Table 1.) Two levels of three independent 
variables were used in a factorial design with five 
subjects (Ss) per cell. The first independent variable 
was achievement test sequence: this was done to 
control for a known minor difference in test diffi- 
culty. The second independent variable was the 
format, linear versus scrambled; half of the students 
got one version, and the balance the other. The next 
independent variable was student participation, to 
check on the hypothetical superiority of the 
scrambled-book version, The Participation condition 
had half of the students using IBM preperforated 
cards for the recording of answers to the book ques- 
tions, whether they were using the linearized or 
scrambled version. The rest of the students had no 
such cards and thus Presumably got themselves less 
actively involved in the book, 

“Subjects” was the last independent variable. The 
Ss were all electronic technicians who had been em- 
ployed for a varying number of years at Collins 
Radio at Cedar Rapids, Iowa and had different ex- 
periential and educational backgrounds, The Ss were 
assigned serially to one of the eight treatment condi- 
tions as they became available to the study. It was 
hoped to get six Ss for each of the eight treatment 
cells, but several people were not able to complete 
the book and the posttest satisfactorily, Luckily these 
people with incomplete records did not occur in the 
same treatment group so that in the final analysis 
there were a total of five Ss for each of the eight 
treatment cells. Because of a limited number of 
text books, it was desirable to split the training-test- 
ing phase of the experiment into two sections, Ac- 
cordingly about half of the test Ss were trained and 
tested in the summer of 1966 and the second half in 
the winter of 1967. Data from the two sections were 
pooled. 


RESULTS AND Discussion 


The median time for technicians to com- 
plete the book on their own time was 12 hr., 
with extremes ranging from a minimum of 9 


D. H. SCHUSTER 


to a maximum of 40 hr. The use of card par- 
ticipation did not particularly affect these 
times. The time involved in pretesting was ap- 
proximately 1 hr. on company time with an- 
other hour likewise in posttesting. 

An analysis of variance was made of the 
gain in test scores from pretesting to post- 
testing. The posttest scores themselves could 
have been analyzed but were subject to pos- 
sible sampling biases in pretest technician 
ability in the eight treatment cells. This could 
have been corrected by an analysis of co- 
variance with the two covariables of pretest 
score and format. The use of gain scores ap- 
peared simpler. Consequently, the analysis 
basically rests on the gains in test scores, as 
shown in Tables 2 and 3. 

Table 2 presents the analysis-of-variance 
summary for the test gain scores, The basic 
datum for each S was to subtract his pretest 
score from his posttest score and then to 
analyze these resultant gain scores. The only 
significant treatment in Table 2 was that for 
test sequence. This was expected as the pre- 
liminary studies had shown that Form A was 
harder than Form B. This is reflected in Table 
3 where it is seen that a greater gain occurred 
when Form A was given as pretest and Form 
B as posttest. This, of course, is to be ex- 
pected if Form A were harder than Form B. 
Neither the main effect of the book format 
nor of card participation was significant. Nor 
were any of the interactions significant. This 
lack of significance is also reflected in Table 
3 where. the averages are given for the treat- 
ment conditions. For the difference in book 


TABLE 2 


ANALYSIS-OF-VARTANCE SUMMARY FOR 
Test GAIN SCORES 


Treatment and effect | SS | df | MS R 
Test sequence (A) 168.10 | 1 | 168.10 | 13.74* 
Book format (B) 14.40 1 14.40 1.18 
Card participation (C) 0.90 1 0.90 | 0.07 
AXB 8.10] 1] 8.10} 0.66 
AXC 14.40 1] 14.40] 1.18 
BACI 4.90 1| 490 0.40 
AN BNC 10.00 | 1 | 10.00 0.82 
Within 391.60 32 12.24 — 

Total“ 612.40 39 


0. 


EVALUATION oF Procsamuxen Inerauction 


TABLE 3 
Sommasy ov Gar Scomzs sy Texarexnt 


Test sequence 
Form A first and B last 
Form B first and A last 


Book format 
Linearized 
Scrambled 


Card participation 
No cards used to record answers 
IBM cards used to record answers 


Note.—Form A = linearised; Form B = scrambled, 
oN = 20 data per treatment averar. 


format, linearized versus scrambled, not only 
was the scrambled-book format not superior 
to the linearized-book format, as hypothesized, 
but a little bit of the reverse While 
insignificant, the linearized format had a 
slightly greater gain achievement than did 
the scrambled format. The interpretation can 
be only a conjecture at this point. The linear- 
ized format with its facilitation of review and 
easy coverage of all answers was just as ef- 
fective as the scrambled-book format with its 
somewhat greater participation supposedly. 
Clearly participation is not part of the answer 
as the effect of card participation was in- 
significant although slightly in the expected 
direction; that is, students using cards to 
record their answers did have a slightly higher 


achievement gain than students without the 
card participation. Or perhaps greater ir- 


ritation went with the participation in the 
scrambled book. 

Two other studies are relevant at this point. 
Brown (1966) varied the location of the 
answer in four different versions of the same 
programmed instructional package. i 
answer locations were “the next line down” 
or “on back of the page.” He found that the 
formats with an easy opportunity to cheat re- 
sulted in less achievement than where the 
opportunity to cheat was less frequent or 
harder. Kapel (1966), on the other hand, 
found that cheating did not have this detri- 
mental effect. Cheating, defined in terms of 
opportunity to review material or to search for 


posedly greater participation on the part of 
the students using the scra 
Perhaps the answer does lie in the easy avail- 
ability of all answers in the linearized format 
for review and optimal search purposes. 


There was one me flaw in the 
present study, namely that there was no con- 
trol group used to evaluate the test-retest 
effect itself. It is fairly well known that a 
person will improve on an achievement test 
if he takes the same test of factual knowledge 
on a second occasion. This is not the case 
with esoteric subject matter as has been dem- 
onstrated in other studies. The assumption 
was made in this study that the amount of 
improvement in knowledge about electronic 
test equipment would be very slight in the 
5. or 6-wk. period between testing unless S 


TABLE 4 


Format PREFERENCE VERSUS FORMAT 
STUDIED BY TECHNICIANS 


p 


Format preference 


Format studied 
Linear 7 Scrambled 
Linear 23 2 1 
Scrambled 18 4 3 
Totals 41 6 4 


— — 
„4.23, 08 by grouping "?” and “Scrambled.” 


312 


had indeed gone through the textbook. If 
the study were to be redone, it would be ad- 
visable to include a control group to evaluate 
the test-retest effect over a comparable time 
period even though it might be hard to obtain 
Ss for this purpose. 

Reader preference for one book format 
versus the other is a final interesting consid- 
eration. As students came in to take the post- 
test, they were asked to fill out a short ques- 
tionnaire. One of the questions was “Read an 
item or two in the other book format than the 
one you had. Which do you prefer, linear or 
random? Why?” The student tabulations to 
this question are given in Table 4, Ignoring 
the question mark response, these results 
show an overwhelming 41:4 opinion prefer- 
ence in favor of the linear format. The sig- 
nificant chi-square value means that students 
going through the linear book preferred it 
more than did students going through the 
scrambled book. Even though there was no 
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difference in achievement between the two 
book formats, the tremendous opinion prefer- 
ence for the linearized version has a very 
practical consideration in terms of student 
usage. Consequently, the decision was made to 
go ahead and publish the linear edition 
(Schuster, 1968). 
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The logic and development of a technique for evaluating job performance 
directly in terms of organizational objectives without reference to normative 
materials are described. The technique involves the establishment of critical 
performance levels, as derived from a Guttman scale, in terms of stated or- 
ganizational objectives and relating performance to these levels. The retest re- 


liability, internal consistency, predict 
technique are described. 


Throughout the history of psychological 
and educational measurement, emphasis has 
been placed on the use of norms for interpret- 
ing test scores. It has been commonplace to 
encounter the statement that a given raw 
score on a test is meaningless until it is com- 
pared to scores made by other people. Text- 
books have usually stressed the use of per- 
centiles, standard scores, age norms, etc. 

Several recent developments, however, have 
suggested that there are other, less norma- 
tively oriented methods for deriving meaning 
from test scores. For example, the elaboration 
of scaling theory has drawn attention to the 
internal relationships of the items of the 
score scale itself, somewhat aside from the 
people from whom the original score data 
were obtained. Also, systems technology has 
concentrated on the importance of the func- 
tioning of the system as 4 whole and has 
evolved concepts of personnel subsystem de- 
sign and evaluation which relate human per- 
formance directly to system performance. In 
this context, Glaser (1962) has contrasted 
the use of “criterion-referenced” measures, 
which tie scores directly to a given criterion 
behavior, with “norm-referenced” measures, 
which are interpreted in terms of the score 
distribution of a representative group of peo- 
ple. With the growing sophistication concern- 
ing the problem of assigning meaning to test 
scores, even current standard textbook treat- 


1 The research reported in this paper was per- 
formed under Contract Nonr-4153(00) between the 
Office of Naval Research and Applied Psychological 
Services, The cooperation and encouragement of 
Victor Fields, Sidney Friedman, Charles Hodges, and 
Martin Wiskoff, Bureau of Naval Personnel, are 
gratefully acknowledged. }; 
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ability, and concurrent validity of the 


ments of the issue include mention of such 
matters as types of scales and expectancy 
tables (e.g., Cronbach, 1960, pp- 69-73). 

The purpose of the research reported here 
was to develop a performance appraisal pro- 
cedure which would relate each individual’s 
score to a standard, external to the body of 
personnel, that is, a standard defined by the 
criterion performance required in order for 
the particular element of the industrial or 
military organization to accomplish its or- 
ganizational objectives. 


METHOD 


The job constellation under study was the family 
of enlisted positions associated with naval air elec- 
tronics (avionics) work, and the method employed 
consisted of the following steps: 


1. Formulation of statements of the specific sys- 
tem objectives of naval air electronics and determina- 
tion of the importance of their respective contribu- 
tions to system requirements ; 

2. Establishment of critical points on a Guttman- 
type job-performance scale, the critical points repre- 
senting different levels of satisfying the objectives; 

3. Estimation of the reliability of the instrument, 
derived in Step 2, through fleet administration in six 
operational squadrons; 

4, Investigation of the extent to which the cri- 
terion instrument, developed in Steps 2 and 3, is re- 
lated to various Navy predictor instruments and a 
job-sample performance test. 


Definition and Importance of Objectives 


A preliminary list of the objectives of the naval 
avionics personnel subsystem was prepared following 
a search of a wide variety of documents concerned 
with naval and electronic maintenance activities. 
The search included examination of such sources as 
bulletins of training course offerings, including pre- 
requisites and purposes of the courses, volumes giving 
descriptions of the duties of the various occupational 
specialties, and equipment operation and mainte- 
nance manuals. Throughout such literature there 
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This preliminary set of objectives was reviewed 
with 18 individuals in high-level assignments involy- 
ing clectronics maintenance, training, and supply. 
Fourteen of the individuals were attached to the 
Staff of the Commander of Naval Air Forces in the 
Atlantic Fleet; the remaining four were members of 


when equipment is operated, that is, output charac- 
terized by precision and variability appropriate to 


2. Performance: To complete any given mission in 
minimum time at appropriate level of accuracy and 


3. Preparation: To Prepare for personnel require- 
ments of présent and future equipment, systems, and 


situations through use of training programs, mainte- 
nance of high morale, etc; 


4. Readiness: To efficiently maintain self, equip- 
„5 


Items 


ment, and systems in a state of readiness consistent 
Fleet H 


S. Safety: To carry out duties with maximum pro 
tection for men and equipment consistent with mis. 
sion, 


Estimates of the relative importance of these ob- 
jectives were needed so that they might later be 
combined in the formulation of overall goals and 
the resulting establishment of single critical per- 
formance points in addition to critical performance 
points for each objective separately. The objectives 
were scaled along an importance dimension by the 
constant-sum method described by Comrey (1950), 
one of the fractionation methods which produce ratio 
scale values. The judgments required for the scaling 
analysis were obtained from 57 commissioned officers 
and 1 warrant officer, from 30 different operational 
squadrons, all of whom had current responsibility 
for avionics activities, Scale values were derived in 
accordance with Torgerson’s (1958, pp. 108-112) 
modification of Comrey’s analytical Procedure. This 


Establishment of Critical Points on 
Generalized Job- Performance Scale 


Prior research (Schultz & Seigel, 1961) had de- 
veloped an 8-item hierarchical group of electronic 


been found to Yield a coefficient of reproducibility 
of .94, and an index of consistency (Green, 1956) of 
35. The items of this scale ate shown as follows; 
the scoring values associated with the items are con- 
secutive integers from 1 for “Employing safety pre- 
cautions on” to 8 for “Employing electronic prin- 
ciples involved in maintenance of.” 


Employing electronic Principles involved in maintenance of. 
Troubleshooting /isolating malfunction(s) in N 
Calibrating . . . 


bra 35 


Knowing relationship of equipment to other related 
Following block diagrams for 
Removing . . . 


Employing safety Precautions on. 


Critical Points on this scale Were established by 
adapting the procedures of the Psychophysical method 
of limits. Two threshold points were sought for each 


61 noncommissioned officers in 23 operational squad- 
rons who were supervisors in the avionics specialties.a 
i The scale was reproduced on each page of a ques- 
tionnaire and was headed by one of the five objec- 
tives. The Ss i i 
were instructed that the scale items 


* Aviation electrician’s mate, aviation electronics 
technician, aviation fire control technician, aviation 
antisubmarine warfare technician, and training de- 
vices man. 


Assovutr Sata or Jos Prarommance 
TABLE | 


on Eacu Navar Aviosscs Ospecnive 


6. 1.70 | 7.11 
3.84 | 1.62 | 457 
S 


Minimum acceptable 
Descending 4.37 | 1.27 | 4.21 | 1.74 | 475 | 176 
Ascending 224 | 1.16 | 263 | 1.53 | 339 | 1.79 

M 3.31 342 407 


Note.—N = 61 senior noncommissioned officers in avionics ct 


most of the equipment typically in- two sets of judgments of 


ascending and descending series, and the final thresh- column of 
old value is taken as the midpoint between the 
threshold values arrived at in the two series. The 
cedure in this study was to take Ss, page by 

through the questionnaire. As they considered 
objective, they were asked to indicate: (a) 
on the scale which described the lowest 

they could make some use of in meeting the 
and (b) the place which described the 
man who would be a desirable journeyman 
sense that he would accomplish his assigned 
as to fully achieve the objective. This constitu 
ascending series. Then, on additional pages, they 
asked to indicate: (a) the place on the scale which 


3 


BP 
725 

iz 

21 

i 

£ 


Ès 


minimum 


were to be interpreted as completing the blank in minimally acceptable perf 
the sentence: “A technician at this level is capable mum acceptable critical point, and 


8. 
Fs 
Ẹ 
EJ 
2 
g 


8 
as 


lish threshold points, stimuli are presented in both objectives was also computed and ls 


tion objective was felt to call for 
jective 
they could not make betence and the performance o 
i feature of the data in 
some use of in meeting the objective, and (b) the 5 155 — 


as being a desirable journeyman in meeting the ob- i precise points but, 

jective. This constituted the descending series. eee eee the 00% scale. or else 
Employing the scoring values associated with the qusiens, which were quite different in the two 

items that the Ss had checked, the mean of the ated of these judges do not reflect t 


ascending series was computed over all Ss for the proaches to the same point. 


é indicated points of minimally acceptable and de- 
* sirable performance on each objective, as was the Reliability 


2 f 
mean of the descending series. Then, the ite value The so performance scale, prepared 


ted, and its value 
these respective means was compu! ais 1, was employed 


was taken as the critical point on the job-per- 
formance scale. This resulted in two critical scores 

z for each naval avionics objective: the mean of the 
f ascending and descending judgments of the point of 


supervisors in evaluating 


a total of 
avionics technicians. Eight weeks 


E 
| 


| 
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TECHNICAL PROFICIENCY CHECKOUT FORM 


The mon roted on this form should be o striker or petty officer, third closs, in one of the following 
ratings: AE, AT, AQ, TO, AX. 


Listed below ore eight tasks thet © man in one of these ratings moy be called upon to perform, 

You ore to indicate whether the mon being roted is checked out on each tosk, i.e., is capable of 
doing the tosk “on his own" without direct supervision. Indicate your decision by placing on 
either In the "CHECKED OUT™ or "NOT CHECKED OUT" box to the right of each tosk descrip- 


tion, 


NOT 
TASK DESCRIPTIONS CHECKED OUT CHECKED OUT 


1. Copable of . safety precovtions on most of this 
squedron's equipment wi ich his rating is concerned. o o 


2. Copoble of replocing most of this squodron's equipment 
with which his rating is concemed. 


3, Copoble of removing most of this squedron's equipment 
with which his rating is concerned, 


4. Copoble of following block di for most of this 
squodron's equipment with which ki rating is concerned. 
5. Copoble of knowing relationship of equipment to other 
reloted equipment with which his rating is concerned, 


é. Copoble of calibrating most of this squadron's equipment 
with which his rating is concerned. 


7. Copeble of trouble shooting/isolating malfunction(s) in 


most of this squadron's equipment with which his rating is 
concemed. 


ER E 
ee T a T a 


8. Capable of employing electronic principles involved in 
maintenance of most of this squadron's equipment with which 
his rating is concemed. O oO 


MAKE CERTAIN THERE IS AN "X" IN A BOX NEXT TO EACH TASK DESCRIPTION 
Nome of supervisor 


Rating 
222 et pln AL orl, 

Rating 
— ĩì“ “] Al 


Location Date 


Nome of mon rated 
Squadron 


Months supervisor hes known man 


— 


Fic. 1. Appraisal form based upon job-performance scale, 


nicians were reevaluated by the same supervisors and job-performance measuring instrument over an 8-wk. 
the product-moment correlation coefficient was com- period. 

puted between the scores the technicians received on An instrument which possesses the characteristics 
the two evaluations. This correlation coefficient was ofa Guttman scale is internally consistent. However, 
97, indicating a satisfactory level of stability for the to determine the effect upon the internal consistency 


f this instrument of application to a different sans- 

than the one on which it was originally developed, 
‘data from the sample employed for stability 

non were also subjected to reproducibility and 
cy (Green, 1956) analyses, Table 2 presents 
of these analyses, showing the respective 
re dbility and consistency values in association 

ith those obtained in the original development of 
the scale. It is seen that the scalability of the items 
not deteriorate upon re-use. 


hips to Predictors and Performance 


4 Copies of the Technical Proficiency Checkout 


m and a biographical information form were sent 


fo each of 13 squadrons of the Atlantic Fleet. These 


accompanied by a request from the Bureau of 
h Personne! that the Technical Proficiency 


Checkout Form be completed by immediate Une 


sors of the journeyman avionics technicians 


ö ‘the unit and that the biographical information be 


ded for each man rated. The biographical in- 


*s Basic Test Battery, final grades in the par- 
r Basic Specialty Training School attended, 


‘and final Basic Specialty Training School class stand- 
commonly 


derived cut points for the combined objectives and 
each of the predictor variables. Table 3 presents 


these correlation coefficients. The Ns vaty for the 
various coefficients because of incomplete data for 
some cases. 

In order to assess the relationship of the Technical 
Proficiency Checkout Form to another measure of 
job performance, a job-sample test of knowledge 
and skill in electronics was constructed. The first 
part of the test required the technician to identify 
the safety violations shown in nine pictures. Part 2 
consisted of 11 questions concerning seven block 
diagrams of electrical or electronics equipment. In 
the third part, the examinee was asked to trouble- 
shoot four malfunctioning electrical circuits in a 


TABLE 2 


REPRODUCIBILITY AND CONSISTENCY or 
Jon-PERFORMANCE SCALE 


Reproduci- Index of 
Set bility consistency 
fee — 
Original development* 94 55 
First administration” 97 57 
Second administrationb 98 70 
aN = 181. 


bN = 64, 


Arsot Ur Scauinc ov Jos Prerommance 


TABLE 3 


specially designed 
score on the test, the three parts were 


proportion in 
cach of the three performance levels, as measured by 
Proficiency Checkout Form, was “de- 


performance but that it is 
moderately related to another type 
formance measure. Such a picture is common in the 
study of job-performance criteria. 


fectiveness of that proficiency level for ac- 
complishing the overall mission of the avionics 
technician in the Navy. Having established 
such a relationship, the score of any individual 
technician could then be interpreted in terms 
of the expected absolute level of output of 
that individual, that is, the absolute level of 
his usefulness to the fleet. The checklist, 
called the Technical Proficiency Checkout 
Form, had a satisfactory degree of retest re- 
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liability, and it retained the Guttman scal- 
ability properties across two independent sam- 
ples (original development administration and 
two administrations in this research). While 


performance 
test. In terms of the general goals of this re- 
search, it seems that the establishment of a 
reliable, valid, behaviorally based evaluative 
technique, that is not normatively oriented, 
has been at least partially achieved. It is also 
believed that the methods employed are ap- 
plicable to performance evaluation in other 
contexts and job families. 
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Comaxy, A. L. A proposed method for absolete 
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F. A method of scalogram analysis using 
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skills in job performance. Journal of Applied Piy- 
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NR Nu, 
COMPARISON OF OPERATOR PERFORMANCE ON A 


TRACKING TASK USING DISCRETELY AND 
CONTINUOUSLY VARIABLE CONTROLS 


M. HAMMERTON axe A. H. TICKNER * 
Medical Research Council, Applied Prycholegy Research Usi, 
Cambridge, Enghand 


An account Is presented of an investigation devised to extend compurien of 
operator performances using discretely and conta te 


trols. 2 groups, of 12 Ss each, learned 1 


held equally well for acquisition tasks, and 
to investigate the relative ease with which 
the two types of control were mastered 


METHOD 


position to a target, represented by a circle 3 mr. in 
diameter, and to keep it within that target for 2 sec., 


continuously. The Ss score was the time taken to scores of 


achieve this. 

The S exercised control using a thumb joystick: 
the position of the stick determined the 
of the spot. With the continuous control stick. 
maximum acceleration—given by full deflection either 


varying 

acquisition tasks, and to compare the ease with which Ss mattered the 
control to a stable level, then 

ferred to the other, which was also practiced to a wee 
that the discretely variable control was harder to aod 
to change over to (p < 01) when used to the other, although it was 
that there was no significant difference between 


(b) The overall group mean of the last five 


Ade of center--was 7 cm/sec’; the sense of the group mean 

Acceleration was the same as the direction of the de- a stable level 
flection of the stick from its (lightly spring-loaded) is the number of posttransfer trials required 
central position; the acceleration varied linearly with 0 reach a stable performance level with the 


geometry as the continuous one, but to give an out- 
put the stick had to be in one of five spring-loaded 
indent positions. These positions were at the center 


all—that is, after transfer. 


stick. 
(b) d= 100(1 — T/L), where T is the 
f the first posttransfer score, 


of travel, at either extreme, and halfway to cithe and L is the group mean of the last pretrans- 


extreme. (The total travel was the same in 
sticks.) The output from these step positions was 0 
in the center, 2. 35 cm/sec” at the halfway points, 
and +.7 cm/sec” at the extreme points. 
Procedure. Two groups of 12 RN. ratings acted 


as Ss for this experiment, The first group was trained SAk 4 20 


1 The authors would like to acknowledge the help 
they received from F. W. Roberts in preparing and 
maintaining the apparatus. 
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measure di = 93%; first-shot transfer mea- 
sure d, = —18%; initial learning measure m; 
= 28 trials. 

It was found for Group 2: stable group 
mean score with continuous stick = 10.38 
sec.; posttransfer stable group mean score 
with stepped stick = 11.04 sec.; transfer sav- 
ings measure a; = 80%; first-shot transfer 
measure d = —226%; initial learning mea- 
sure mz = 20 trials. 

t tests indicate that none of the stable 
values differs significantly one from another 
at the .05 level. Applying U tests to the in- 
dividual a, u, and d values, however, indicates 
that: 


az < a (p < 05, one-tailed) 
dz <d, (p < 01, two-tailed) 
na <n, (p < .05, two-tailed) 


M. Hawerton anp A. H. TICKNER 


CONCLUSIONS 


Hunt’s (1966) finding that there is little to 
choose between the two varieties of control in 
steady operation is confirmed. It is evident, 
however, that it is more difficult to master the 
stepped stick, and more difficult to take it up 
when used to the other. Consequently, it would 
seem that the adoption of stepped as op- 
posed to continuous controls is not to be 
recommended. 
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A e. 
VEHICLE KINEMATICS AND GAP ACCEPTANCE ' 


PAUL M. HURST, KENNETH PERCHONOK,* aw EDMOND L. SEGUIN * 
Institute For Research, State College, Pennrylvanis 


Data from traffic photographs under field conditions were analysed to isolate 
those variables determining gap acceptability in a merging situation. 78 akterna- 
tive expressions were computed, involving positional and velocity information 
for lead, following, and subject vehicles. Regarding each of there indices as a 
correlate of gap acceptability, correlation ratios were computed between ac- 
ceptance-rejection behavior and the value of each index at the “decision point.” 


Values of u ranged from .187 to .733. Each of these values, except the lowest, 
was significant at p<.01, In general, the highest correlations were obtained 
with expressions involving time or speed relationships between subject and fol- 
lowing vehicles. Implications are presented for the use of an index and its cor- 
responding u values in the assessment of comparative degrees of “confusion” as 
a function of roadway and/or traffic-control parameters. 


Knowledge of factors influencing driver be- 
havior is necessary for control and prediction 
of traffic characteristics. In recent years, at- 
tention has been given to the effects of inter- 
relationships between vehicles upon the behav- 
ior of the drivers concerned. One such setting 
is the interaction between two vehicles form- 
ing a gap and a third which may or may not 
accept that gap. In this situation, the driver 
of the potentially merging vehicle may be 
thought of as judging the relationships be- 
tween the three vehicles along a single dimen- 
sion so as to make a decision to accept or re- 
ject the gap. 

This paper is addressed to the problem of 
determining those combinations of vehicle kin- 
ematics—vehicle separation, relative speeds, 
and acceleration potentials—which best cor- 
relate with observed gap acceptance/rejection 
behavior. The setting was one in which lane 
changing was necessitated by blockage of the 
medial lane on an urban expressway. 

In order to determine the predictive power 
of a given combination of kinematic factors, 


1 This investigation was supported by Public 
Health Service Research Grant No. AC-66, from the 
Division of Accident Prevention. The authors wish 
to thank Edward Gervais and Frank DeRose, Jt» 
and their associates on the John C. Lodge Freeway 
Traffic Surveillance and Control Research Project, 


Detroit, without whose cooperation this study could 


not have been done. 2 4 
2Now at Cornell Aeronautical Laboratories, Buf- 
falo, New York. 
3Now at HRB-Singer, Inc, State College, Penn- 


sylvania. 


called gap index (g), reliance was given pri- 
marily to , the correlation ratio. Two qual- 
ities desired in a gap index are (a) that dif- 
ferent values of the index yield different re- 
sponses, and (b) that different drivers respond 
similarly to the same value of the index. The 
degree to which these criteria are met reflects 
the degree to which drivers apparently utilize 
the gap index under study to make their de- 
cisions with respect to merging. The value of 
is not dependent upon the particular nature 
of the functional relationship (e. g., linearity), 
but rather measures the total degree to which 
the stimulus controls the response. When 
squared, ņ measures the proportion of accep- 
tance/rejection behavior accounted for by the 


gap index. 
PROCEDURE 


A three-lane section of the John C. Lodge Freeway 
in Detroit was the site of the data collection, From 
the roof of a building adjacent to the freeway a fixed 
observation area, approximately 350 fl. in length, 
was photographed with a 35 mm, camera at the rate 
of eight frames per second. Alternate frames were 
used in the analysis, producing a four per second data 
rate. During the observation periods a maintenance 
vehicle had been moved into the downstream end 
of the observation area, closing the medial lane and 
thus forcing vehicles in this lane to merge into the 
adjacent lane. The photographs of resulting vehicle 
behavior were reduced to usable quantitative form 
by processing through the Pictorial Data Transducer 
system developed by the Institute For Research 
(Hicks, 1964). This system is a film-scanning device 
coupled with an IBM 024 card punch which enables 
an operator, by positioning a pair of cross-hairs, to 
transform photographic information into digital posi- 
tion data in an XY coordinate system. The punched- 
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Fic. 1. Diagram of a gap occurrence at the decision 
point. (S = subject vehicle; F = following vehicle in 
adjacent lane; L kad vehicle in adjacent lane; 
MV = maintenance vehicle blocking medial lane; Ls 
= length of subject vehicle; Xy = position of follow- 
ing vehicle; XI. S position of lead vehicle; Xs = 
position of subject vehicle.) 


card output is then processed through a battery of 
special-purpose programs using IBM 1620 and 7074 
computers, The computer output consists of point- 
by-point positional and velocity information for all 
vehicles in the observation area. For a more complete 
description of this system, see Hicks (1964). 


Measures 


From the film segments chosen for study, a total 
of 107 gap occurrences were analyzed, including 68 
acceptances and 39 rejections. This number does not 
reflect the actual occurrence rate, since to be included 
in the analysis an “occurrence” had to take place en- 
tirely within the observation area. That is, the lead 
vehicle had to pass the subject vehicle, and the sub- 
ject vehicle either enter the gap or be passed by the 
following vehicle, with all these events occurring 
within the 350-ft. interval. This restriction was neces- 
sary in order (a) to insure that any gap occurrence 
labeled as “rejected” was observed over the entire 
time interval during which acceptance could have 
occurred, and (b) to permit measurement of all ap- 
propriate kinematics at a common point, the “de- 
cision point.” To control motivation, only gaps pass- 
ing the subject vehicle were considered; gaps which 
the subject overtook and Passed were not counted as 
occurrences, 

The following measures were computed at the “de- 
cision point,” defined as the instant (to the nearest 
t sec.) that the lead vehicle defining a gap passed a 
subject vehicle in the closed lane, when the configura- 
tion was shown in Figure 1: 

MV = maintenance vehicle blocking medial lane. 

S = subject vehicle, 
L = lead vehicle in adjacent lane. 
F = following vehicle in adjacent lane. 

Xs = position of subject vehicle (front, longitudi- 
nal coordinate), 

Xt = position of lead vehicle (rear, longitudinal 
coordinate). 

Xr = position of following vehicle (front, longi- 
tudinal coordinate). 

Ls = length of subject vehicle, 

In addition, the longitudinal component of velocity 
at the decision point was obtained for each of the 
three vehicles in question: 


P. M. Hurst, K. PercHonok, ann E. L. SEGUIN 


Vs = Velocity of subject vehicle. 

Vi = Velocity of lead vehicle. 

Vr = Velocity of following vehicle. 

From the foregoing measures, it was Possible to 
compute the following terms which were utilized in 
the 27 alternative expressions listed below. Computed 
decelerations are based upon the assumptions of a 
reaction time of Ẹ sec. and a constant frictional co- 
efficient of 6. 

Hr = Xi — Xr — Ls = Distance headway antici- 
pated for following vehicle. 

Sr = 75 Ve + Vr?/38.4 = Stopping distance required 
for following vehicle. 

Tr =.75 Vr/19.2 = Stopping time required for fol- 
lowing vehicle. 

Sr-s = .75 Vr + [(Ve*— Vs?) /38.4] = Distance re- 
quired for following vehicle to match velocity with 
subject vehicle if the latter does not accelerate after 
the decision point. 

Tr.s 75 + (Vr — Vs)/19.2 = Time required for 
following vehicle to match velocity with subject 
vehicle if the latter does not accelerate after the 
decision point. 

These terms were utilized in 27 alternative gap in- 
dices; a twenty-eighth index, TH, the time headway 
between the subject of following vehicles was also 
studied. The indices are listed below, with an accom- 
panying value of the correlation ratio, 7, expressing 
the correlation of each index with gap acceptance/ 
rejection behavior. The same 107 gap occurrences 
form the basis for each calculation. 

It should be emphasized that each of these alterna- 
tive indices, with the exception of TH, uses only the 
information Potentially available to the driver at the 
Point in time arbitrarily selected as the “decision 
point.” For example, Hr / Vr is really a measure of 
projected time headway based on the assumption of 
constant Fr. Thus, Hr/Vr is the “time headway” 
projection actually available to the driver at his de- 
cision point. 


RESULTS 


As can be seen from Table 1, the values of 
n ranged from. .187 to .733. The data were 
analyzed for statistical significance using an 
arcsine transformation for the proportions of 
acceptances in each interval. From these 
data, it is possible to compute a between-cell 
sum of squares with an approximate chi- 
square distribution. The transformation was 
necessary because the dependent variable, ac- 
ceptance/rejection, is dichotomous. 

The »=.187 value was not significant; 
each of the other values was significant at 
P= 01. 


Discussion 


Although some of these indices of gap ac- 
ceptability have been studied by others (cf., 
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TABLE ! 


CORRELATION RATIOS FOR VARIOUS INDICES OF ¢ 
WITH GAP ACCEPTANCE/REJECTION 


Expression for g 2 
1. Vs 261 
2. Vi 187 
3. Vr 479 
4. He 443 
5. Sr 479 
6. Ir 479 
7. Sr-s 473 
8. Tr-s 491 
9. He/Ve 511 
10. H/ (Vr — Vs) 892 
11. Hr/ Sy 537 
12. {Hr/Vr}/Tr 542 
13. (H/ (Vr — Vs)}/Tr .620 
14. Hy/S¥-s 885 
15. (H/V Tacs 573 
16. {Hr/(Vr — Va)}/Tr-s 693 
17. Va — Vu 470 
18. Ur — Sr 526 
19. (Hr / Vr] — Tr 494 
20. (Hr / Vr — Vs)} — Tr 632 
21. Hy — Sr-s 578 
22, (Hy/Ve} — Des 733 
23. Hr / Vr — Vs)} — Tr-s 605 
24. |Vs — 1/2(Vr + Vi) 500 
25. (Vs — Vr)/Hr 601 
26. Hr/|Vs — 1/2(Ve + V S71 
27. |Vs — 1/2(Vr + YH 533 
28. TH 60 


e. g., Glickstein, Findley, & Levy, 1961), 
direct comparison of results across studies is 
perilous. The major factor tending to distort 
such a comparison is the influence, upon n, 
of the range of dispersion of the stimulus 
variable. Since » does not specify the func- 
tional relationship between stimulus and re- 
sponse values, it is impossible to adjust it for 
range effects as is often done with the Pear- 
son r. Vet such effects probably do operate in 
a strong if indeterminate manner. 

Thus, the proper comparison of 5's is pos- 
sible only when the same sample of behavior 
is involved in each computation. In the data 
from the present study, a fixed sample of oc- 
currences and acceptances, defined by points 
on photographic film, formed the basis for 
comparing all of the alternative predictor ex- 
pressions. Were this sample to be augmented 
by a number of very large gaps which were 
generally accepted, and by a number of very 
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small gaps which were generally rejected, the 
effect would tend to increase the j obtained 
for any measure having a nonchance relation- 
ship with acceptance/rejection behavior. It 
would still be appropriate, however, to make 
comparisons within the new sample between 
y's obtained for various alternative measures 
of gap size. 

Restricting consideration, therefore, to with- 
in-sample comparisons, several interesting ef- 
fects are to be found. First, note that physical 
gap size or vehicle separation, Hy, was a 
rather poor predictor of acceptance/rejection 
(4 = .443, the third from the lowest value ob- 
tained). Time headway, TH, was better (4 = 
.606, the fifth in rank of magnitude). Expres- 
sions involving the dynamics of the lead 
vehicle (L) generally yielded low y's. The 
four highest values include the terms Hy and 
Vp. The two highest of these, No. 22 and No. 
16, involve a Ts term which corresponds to 
time required for F to match velocity with S. 
In No. 22, Tr s is subtracted from the time 
headway of F with respect to L, whereas No. 
16 represents the ratio between collision course 
time and Tps. This collision course time re- 
fers to the time required for F to reach S if 
S were to enter the gap at the decision point, 
in a position immediately behind that cur- 
rently occupied by L. A certain strategy on 
the part of S’s driver is suggested here: he is, 
perhaps, concerned with the ability of F to 
accommodate him by decelerating before F’s 
headway is exhausted. This could be a con- 
sideration regardless of whether S’s driver in- 
tended to contribute to the velocity matching 
by accelerating his own vehicle. 

Recalling the admonition against direct (as 
opposed to relative) comparisons across stud- 
ies, it is nevertheless tempting to compare re- 
sults with those of Glickstein et al. (1961). 
Although their data concern on-ramp merging 
behavior, and are presented in terms of con- 
stant-size class intervals (as opposed to the 
constant-frequency class intervals used here), 
the comparison may be of heuristic value. 

The data from Glickstein et al. are sub- 
divided according to merging by moving vehi- 
cles and merging by stopped vehicles; 97% of 
our data were from moving vehicles. Glick- 
stein et al. used only one measure of gap size, 
«size of gap presented in seconds,” which cor- 
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responds to our measure of time headway, 
TH. From the data presented by Glickstein 
et al. in their Table 4, we computed values 
of » for time headway versus acceptance/re- 
jection, and obtained a value of 463 for 


fairly well predicted from this datum alone. 
The driver of a moving vehicle, however, is 
concerned with the relationship between his 
speed and certain time or speed characteristics 
of the gap, so his behavior cannot be well 
on the basis of time headway alone, 

icati of any g index must, of 

Course, await cross-validational studies per- 
formed under a wide variety of conditions, 
If it is possible to obtain such an index (or 
indices) of sufficiently wide generality, we can 
Proceed to compare the degree of stimulus 
control of gap acceptance across differences 
in traffic situations: differences in roadway 
features, traffic control procedures, etc. Sup- 
Pose, for instance, that Situation A yields a 
igh y between g and gap acceptance versus 
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gaps of lower g and accepting gaps of higher 
g. In Situation B, they were less consistent; 
gaps of lower g were accepted relatively more 
often, in comparison with gaps of higher g. 
This would permit the inference that drivers 
were less confused in Situation A. Consistency, 
as implied by », is measurable independently 
of “threshold” shifts, that is, the tendency to 
accept or reject more gaps in general. If it 
is defensible to assume that the driver popu- 
lations which were sampled in the two situa- 
tions are similar, then it can be concluded that 
something about Situation B causes drivers to 
be more confused than in Situation A, and 
proceed to derive and test hypotheses as to 
the source of this confusion. Sometimes the 
source may be obvious. In other instances it 
will be more subtle, but regardless of subtlety 
it should contribute to the understanding and 
control of driver behavior. 
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HEURISTIC SIMULATION OF PSYCHOLOGICAL 
DECISION PROCESSES’ 


ROBERT D SMITH 
Department of Management, Pennsylvania Stale University 


Complex thought processes used by a skilled psychologist dealing with per- 
sonnel selection and placement are analyzed and programmed in s computer 


simulation model, Heuristic methods used to limit the total pomible set of 
decision branches in the model are defined and illustrated. Research results 
indicate a strong relationship between human and machine output with a H% 
level of agreement between simulated psychological inferences and human 
decisions with identical ultimate employment recommendations in 22 of 24 test 
cases. Implications of research findings on psychological instruction, experimen- 
tation, information retrieval, test validation, and general decision making are 


discussed, 


Recent developments in the study of man 
and machine systems have resulted in signifi- 
cant interdisciplinary achievements. The areas 
of psychology, industrial management, and 
operations research have been increasingly in- 
tegrated through quantitative and scientific 
methods which aid in the definition, explana- 
tion, and prediction of human behavior. One 
of the most useful of these methods is simula- 
tion, a research methodology which facilitates 
the design and testing of models of physical 
and human systems, Through the use of 
simulation, the models can be programmed to 
yield results which closely approximate real- 
world phenomena. 

Simulation models are of varying types. 
Some are applied to the study of physical sys- 
tems such as oil refineries and railroad switch- 
ing yards. Others deal with man-machine sys- 
tems such as the Rand Air Defense Simula- 
tion in which military personnel and hard- 
ware exhibit integrated reactions to simulated 
changes in the air environment. Yet another 
type involves human cognition and can be 
used to replicate the manner in which man 
makes decisions in solving complicated prob- 
lems. The purpose of this article is to present 
the methodology, results, and, most important, 
the implications of research dealing with this 
latter form of simulation. 


1 This article is based on a paper presented at The 
College on Management Psychology Session of The 
Institute of Management Science Fourteenth Inter- 
national Meeting, Mexico City, August 26, 1967. 


Considerable work in the development of 
cognitive simulations has been conducted by 
university and privately sponsored research 
groups. Colby (1964) at Stanford, Maruyama 
(1966) at Berkeley, Kleinmuntz (1963) at 
Carnegie-Mellon, Swenson (1962) at the Mayo 
Clinic, and Finney (1967) at The University 
of Kentucky have been instrumental in de- 
signing computerized simulations which de- 
scribe individual psychological characteristics 


of a psychologist involved in the analysis and 
interpretation of test results, job requirements, 
and personal characteristics of potential cleri- 


problems not amenable to solution by algorith- 
mic techniques in which the researcher is 
able to express an objective function and 
constraints in well-defined mathematical terms. 
Instead, it is necessary to rely upon the con- 
struction of simulated decision networks which 
replicate as closely as possible the psycholo- 
gist’s thinking processes. 


METHOD 


As shown in Figure 1, the experimental model ex- 
tended the number and types of variables included 
in previous studies by adding job specifications and 
personal characteristics of applicants. These addi- 
tional variables made the task of simulation more 
difficult since inferences derived from test scores had 
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Fro. 1. Evolution of model complexity. 


to be viewed in light of specific job requirements and 
personal of the individual such as her 
age, experience, and length of time in her present 


position 

Tt was hypothesized that a simulation model could 
be developed which would replicate the thought 
processes of a skilled psychologist as he performed the 
decision task. The Protocol M: was used to 
gather data (ie, the analyst's verbalization of 
thoughts were recorded as he evaluated a series of 
cases dealing with actual job applicants). The ap- 
plicants were all female and were applying for vari- 
ous types of clerical positions requiring varying 
levels of skill. These positions were billing clerk, 
statistical clerk, clerk typist, receptionist, administra. 
tive assistant, and executive secretary. The protocol 
Was tape-recorded, transcribed, and analyzed in depth 
prior to the development of a computer flow chart 
defining the decision processes. Finally the model was 
tested by utilizing a set of cases which were analyzed 


z 


the human and processed by the computer. 


ResuLTS 
In all, 24 cases were evaluated by both hu- 


machine was not programmed to handle the 
input data and consequently the human ana- 
lyst was called in, and, in the other case of 
nonagreement, the human recommended hir- 
ing as a fair risk while the model suggested a 
further background check of the applicant. 
Accuracy of the simulated interpretative 


statements generated by the computer was 
judged to be 94% by a skilled analyst not in- 
volved in the collection of the protocol. Fur- 
ther, statistical tests of hiring recommenda- 
tions showed that the probability of obtaining 
such results by chance was so small that the 
hypothesis was accepted, and it was con- 
cluded that the model did simulate very 
closely the human analyst’s psychological in- 
ferences, 
Limitations 

The preceding paragraphs have very briefly 
presented the methodology and results of the 
research.? It should be noted that no attempt 
was made to improve upon the decision-mak- 
ing process of the analyst and it would have 
been preferable to test a larger sample of 
cases had they been available. It might be 
true that a more valid and reliable model 
would have resulted if the combined decision- 
making capabilities of a group of analysts had 
been used. However, rather than spending 
time in a discussion of the limitations of the 
study, more significant issues involve the use 
of heuristics in development of the model and 
the implications of results on general psycho- 
logical decision making. 


Heuristic Reduction of the Decision Space 


In developing the computer model, the 
matter of relevancy of information became all 
important. Consider the clerical selection bat- 
tery used in this experiment. It is comprised 
of the Otis Mental Ability Test, the Short 
Employment Tests, the Washburne S-A In- 
ventory, and the Gordon Personality Inven- 
tory and Profile. This battery includes 20 
different test scores, some of, which can range 
from 1 to 99 such as the Gordon scores while 
others range from excellent to maladjusted 
(7-point scale) on the Washburne Test. In 
all, there are 1,151 possible psychological test 
Scores, in addition to job and personal data, 
which can be obtained for a single applicant. 
Thus, it is plain to see that the number of 
combinations of relationships and elements 
that can be chosen quickly exceeds manage- 


able proportions. In fact, using the variables 


For a complete discussion of research design refer 
to Smith and Greenlaw (1967). 
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representing a programming task of impos- 
sible magnitude. To avoid this situation, it 
fs necessary to restrict the boundaries of the 

and consider only those relationships 
which are most relevant to the solution of the 
decision problem. 

To simplify development of the experi- 
mental model, the experience of the decision 
maker was organized into a graphic represen- 
tation of thought processes, The 
‘used in the simplification of the model (Le., 
to limit the boundaries of relevance) were 
Actually applied by the psychologist and 
merely transformed into more quantitative 
terms by the researcher. These principles are 
often referred to in the literature as “heuris- 
tics” taken from the Greek heveiskein which 
means to limit search. Heuristics provide gen- 
eral decision rules or “rules of thumb” which Fro. 3. Partial ee. map for inteltiguace 
—4 researchers in the iro sg t pai i 

models of systems which are in 
— hunan and machine capabilities. {fon „ = 
Heuristic solutions to problems are not neces- fact, may not be. Clarkson's (1962) Port- 
sarily the best or optimum solutions since cer- folio Selection and Tonge’s (1961) Assembly 


tic decision map for sociability. 


å Fic. 2. Partial heuris 
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Figures 2 and 5 provide examples of heu- 
ristics as well as the general methodology used 
in mapping the thought processes, Note that, 
in Figure 2, the sociability score has been 
subdivided into three broad ranges al the 
lower end of its scale, and within one of 
these ranges (Le., 6 through 10) it is com- 
pared with certain job specifications. 
not the only place in the program where the 
sociability score is considered, but it is evi- 
dent here that a low score coupled with work 
requiring high levels of verbal proficiency and 


vestigation and seemed to weigh the Otis 

heavier than other factors for most 
jobs. Next, he referred to the clerical aptitude 
results followed by the emotional stability 
scores, and finally the personality profiles. 
This does not mean, however, that he fol- 
Sequence from one test 
to another. If, for example, he found a girl 
who performed remarkably well on the Otis 
Test, he would investigate the Gordon score 
on “original thinking” as well as the type of 


here was to prevent very intelligent girls from 
being placed in positions where their ability 


subsequent interaction among the subsystems, 
but the heuristic technique of fractionating 
the total problem further simplified the work 
and, more importantly, permitted the simula- 
tion to follow a path through the network 
whereby the most heavily weighted variables 
were given highest priority. Items which 
tended to eliminate candidates most often 
were considered almost immediately, and time 
was saved since many unacceptable applicants 
were discovered relatively early in the net- 
work. 


Implications of Research Results 


One of the most important implications of 
the results of this research is the knowledge 
that it is possible to program fairly complex 
psychological decision tasks, This means that 
time normally spent by highly skilled analysts 
in decision making which is fairly routine for 
them can be made available for more signifi- 
cant problem solving and research. Second, 
the probability that boredom might influence 
results may be lessened as the psychological 
evaluator concentrates on those cases of un- 
usual nature which require diverse and flexible 
skills found only in the human analyst. 

With computerized processing of routine 
data comes a marked reduction in the cost per 
interpretation when, of course, there is suf- 
ficient volume to introduce economies of scale. 
Where the Psychologist normally took about 
20 minutes to give a complete analysis, the 
machine performed the same type of task in 
about 3 seconds.“ In this manner, expert 
analyses may be provided to those organiza- 


tions which previously could not afford the 
— sa 

The decision model designed for the experiment 
Was programmed to branch to the next case whenever 
an applicant was rejected. The psychologist, however, 
continued with a complete analysis even though a 
combination of factors led him to formulate a nega- 
five recommendation somewhere during the course of 
his evaluation. 


* 


of skilled psychologists. This may be 
particularly true in personnel offices and 

centers which sometimes offer sub- 
“marginal services in the ficld of psychological 


Simulation models of this type can be useful 


“in the field of computer-assisted instruction. 


A laboratory approach in base psychology 
courses might be established whereby students 
would verify their beginning and Intermediate 


C attempts at analyses against the results of an 


expert. With the information storage and 
retrieval capabilities of today's computers, 
teaching efforts might be made more efficient 
through the use of feedback data. Professors 
could evaluate the most common errors made 
by their students during the laboratory anal- 
yses and thereby concentrate upon these areas 
‘of weakness in subsequent lectures, 


Computerized simulations of psychological 


| decision processes offer other advantages from 
"an information-utilization standpoint. It is 


now possible to study the effects of policy 


changes on end results without disrupting the 


present system. For example, in the personnel 
model discussed in this paper, one could de- 
termine the effect on mental ability standards 
caused by a labor shortage which required a 
business to increase the acceptance ratio by 
20% (le., how much would one have to lower 
mental ability scores to gain a 20% increase 
in the number of applicants accepted). In 
this type of model it would be fairly simple 
to determine the effects of changing the values 
of individual parameters on final decisions as 
well as to determine which factors were re- 
sponsible for the greatest percentage of re- 
jected applicants. If any of these rejection 
factors were amenable to correction 
training, it would be possible to estimate the 
trade offs between the costs of additional per- 
sonnel search versus the costs of training 
persons not quite acceptable under present 
standards. 

From the view of a science of w 8 
ing, this experiment lends some lence 
the theory that human thinking can be defined 
as a complex network of simple binary choices. 
It is difficult to determine whether the ana- 


criteria. But, by sequencing his thought proc- 


StuviaTion oF Decos Processes ym 


particular point 
study of the effects of aging and 
on decision-making capabilities of individeals 
and groups. 


Suggestions for Further Research 


T 
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1115 
[in| 
H 
| 


chological test results and environmental 
tors. The model could then be used to screen 
large numbers of people, some of whom might 


. 
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exhibit symptoms of present or potential 
mental illness. A computer-assisted screening 
center would be used to diagnose cases in 
early stages for referral to a specialist, thus 
providing minimum-cost psychological ser- 
vices to broader segments of our society. 
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EFFECT OF PRICING ON PERCEPTION 
OF PRODUCT QUALITY 


J. DOUGLAS McCONNELL * 


Stanford University 


An experiment was undertaken to test whether the price of a product or brand 
was a determinant of S's perception of the quality of the product. 60 beer- 
drinking adult Ss made 24 selections from 3 brands of beer over a period of 
8 wk. Ss were free to choose whichever brand they wished. Brand names 
were consonant letters of the alphabet, and the product was identical for all 
3 brands. The only real variable was price. Analyses indicated that S's percep- 
tion of the quality of the beer was significantly affected by the cue supplied, 


i. e., price. 


The theory that price differentials will re- 
sult in consumers perceiving product-quality 
differences, where products are in fact homo- 
geneous, has been suggested by Scitovsky 
(1944-45), Leavitt (1954), and Tull, Boring, 
and Gonsior (1964). The experiments by 
Leavitt and Tull, however, were pencil-and- 
paper tests where Ss did not use the brands 
of products presented in the choice situation. 
Their findings, therefore, must be treated 
with reservations. The specific hypothesis 
tested in the study reported here was that, in 
the absence of more directly observable cues, 
consumers accept price as a product charac- 
teristic correlating highly with product qual- 
ity and will alter their perception of a product 
on the basis of the cues supplied. In other 
words, it was believed that Ss would perceive 
the higher priced brands as being of higher 
quality even though the actual product was 
identical for all brands. 

Observation of consumer behavior in the 
market place provides direct support for this 
hypothesis. A $5,000 automobile is perceived 
as a better quality car than one costing 
$3,500; a $3.00 pair of stockings is perceived 
as being superior in quality to an $.85 pair, 
yet both may “run” within 5 min. of being 
worn, An $80 gown is superior to a $45 gown. 
The phrase “you get what you pay for” is 
accepted as a truism in the American culture 
and those of most highly industrialized na- 
tions. 

1 Now at Stanford Research Institute, Menlo Park, 
California, The article is based on research under- 


taken as part of the author's unpublished doctoral 
dissertation, Stanford University, May 1967. 


In view of this, the hypothesis appeared to 
be soundly based. It was believed that if a 
“price-quality” cognition existed the Ss, being 
unable to change the “price-brand” stimulus, 
would change their perception of the product 
to maintain consonance between their cogni- 
tions and perception. 


METHOD 
Subjects 


Sixty adult married students, or wives of students, 
who classified themselves as beer drinkers, were 
chosen at random from Stanford University Married 
Student Housing and served as Ss. Median age was 
27 yr. with a range of 21-55 yr. Family income 
ranged from $2,500 to $15,000, with a median value 
of $5,000. Forty-four Ss were male and 16 female. 


Product 


The product used in the experiment was beer 
which was obtained from a California brewer with 
the approval of the California Department of Al- 
coholic Beverage Control and the United States In- 
ternal Revenue Service. 

‘An order was placed with one brewer for 60 cases 
of lager-type beer, bottled in unmarked, nonreturn- 
able “glass cans,” and sealed with a plain seal. It was 
specified that all beer come from the same batch 
number so that the product would be physically as 
homogeneous as possible. 

It was decided to restrict Ss’ range of choice in the 
experiment to three brands of beer. To identify these 
three brands, cases of beer were drawn from in- 
ventory as they were required and labeled at random 
with white rectangular self-adhesive labels lettered 
L, M, or P in red-felt marker ink. The letters, fol- 
lowing Tucker (1964), were selected for ease of 
jdentification and memory and because each is a 
consonant falling in the middle of the alphabet and 
occurs with approximately the same frequency in 
usage of the English language. Nonsense syllables, 
such as suggested by Davidson, Suppes, and Siegel 
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TABLE 1 
Dara rox CONTINGENCY TABLE ANALYSIS 


Price levels of beer 


Ratings from 
Likert-type | |_—————____ 
scak High | Medium | Lom 

Undrinkable 0 4 1 4 
Poor 1 8 21 20 
Fair 2 26 22 23 
Good 5 15 12 9 
Very pleasant 4 7 4 4 

Total score 133 117 109 

Mean score 2.23 1.93 1.80 


(1957), were considered but rejected on the grounds 
that if they were “association free“ they would be 
difficult to learn—an undesirable attribute for any 
brand name. It was decided to risk the possibility 
that the letters chosen might have meanings for 
some Ss to facilitate learning of the brand names. 
The price was shown to the customer in two ways. 
One, a card behind each test bottle told the price of 
that brand per six-pack since most consumers see it 
in the stores that way. As Ss were not paying for the 
bottle of beer they selected at each trial, the situa- 
tion was considered to be very artificial and not com- 
parable to the purchase decision situation that 
usually confronts a shopper. To minimize the effect 
of these free or “windfall” goods in the selection 
Process, the less expensive brands had to represent 
a financial saving compared with the high priced 
brand, equivalent to the six-pack price differentials, 
This was achieved by taping the per bottle price 
differential of 5¢ on the low priced brand and 2¢ on 
the medium priced brand. Thus, an S selecting the 
low priced brand would be in a position similar to a 


consumer who purchases a low priced brand and has 
more change in his pocket than he would have if he 
bought a high priced brand. 


Procedure 


The Ss were called on in their own apartments at 
Prearranged times. A number were called on be- 
tween 8:30 At. and 9:15 A.M., while the greater 
majority were visited between 5:00 P.M. and 7:30 
P.M. 

On each visit Ss selected one bottle from the three 
brands (L, M, and P) which were set out on a 
wooden tray and backed by a card stating the price 
per six-pack. For each trial the Positions on the 
tray were changed so that any selections based on 
position on the tray would be completely randomized, 
No brand occupied the same Position on the tray 
for any two successive trials, and each brand oc- 
cupied each position with equal frequency over the 
24 trials. The 24 trials were conducted at the rate of 
three per week for 8 wk. The Ss undertook to drink 
each bottle selected at any time convenient to them 
before making their next selection. After their final 
selection in Trial 24, Ss were presented with a ques- 
tionnaire form to be completed. The Ss were asked 
on this form to rate each of the three brands of beer 
on a 5-point Likert-type scale, They were also asked 
to select from a panel of words, commonly used to 
describe beer, the three words that best described 
each brand. The words in the panel are shown in 
Table 3 and have generally favorable and unfavor- 
able connotations with respect to beer. 


RESULTS 


The data were analyzed in three ways. A 
simple contingency table analysis (Mood & 
Graybill, 1963) was made to test the relation- 
ship between the ratings obtained and the 
three price levels of the brands of beer. The 


Summary oF RATING SCORES By BRAND AND VALUES rok T Ustne Metuop 


FOR CORRELATED OBSERVATIONS 
Brand M Brand L Brand P 
Numerical ($1.30) ($1.20) ($0.99) 
value per 
rating point 
No. responses | Score value No. responses | Score value No. responses | Score value 
; 4 0 1 0 4 0 
8 21 21 20 20 
2 26 12 22 44 23 46 
K 15 45 12 36 9 27 
4 7 28 4 16 4 16 
Total score 133 11 
Mean score 223 165 ae 


Note.—According to the Stu 


dent's ¢, mean dil 
rand L = 1.64, p < .06; Brand L versus Brand p cenges between Brand M versus Brand P = 2.46, p < .01; Brand M versus 
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TABLE 3 


Paner or Worps Usep BY Sers TO 
DESCRIBE THE THREE Branns oF BEER 


Favorable Unfavorable 
(implied high quality) (implied low quality) 

Tangy Flat 

Rich flavored Biting 
Smooth Acidy 

Full bodied Watery 

Light Bitter 

Dry Sour 


data for this analysis form Table 1. The rat- 
ings of “undrinkable,” “poor,” “fair,” “good,” 
and “very pleasant” were assumed to be 
monotonically increasing and were given 
values of 0, 1, 2, 3, and 4. 

The contingency table analysis produced 
highly significant findings, the x? distribution 
value being significant at $ < 005 level, 
thereby indicating the existence of a price- 
quality relationship. 

The second test took the mean ratings ob- 
tained for each of the three price levels and 
tested each pair of means for significant dif- 
ferences, using the Student's! distribution and 
method for correlated observations (Winer, 
1962). This method was used because the 
ratings for each brand could not be assumed 
to be independent as each S rated each brand. 
The results of this analysis are given in Table 
2. A significant quality difference was per- 


TABLE 4 


Basis OF DETERMINING NUMERICAL ScoRE FROM 
Worps Cuosen BY SUBJECTS TO 
DESCRIBE BRANDS 


No. responses 
Numerical 
score 


Unfavorable 
words 


Favorable 
words 


TABLE 5 


Summaxy or Anacysrs or Vasiance roe FAVORABLE 
ann Unvavoraste Worps Arrainutes TO 
BrANDS 


3 | SS | a | us | F 


Total | 1800.20 | 179 | | 
Between brands | | 

M, L, P 64.53 | 2 | 32.265 | 3.29° 
Within | 1735.65 | 177 | 9.801 


> = .0S, F = 3,07, df = 2/120. 


ceived between the high and low priced brands. 
The interesting aspect of this analysis, how- 
ever, was that the middle priced brand was 
not perceived as being of significantly higher 
quality than the low priced brand. The “per- 
ceptual distance” between the high and middle 
priced brands was greater than between the 
middle and low priced brands, although the 
actual price differential would have led one 
to suspect that the reverse should have been 
the case. 

The third test of the price-quality relation- 
ship used Ss’ selections from the panel of 
words with favorable and unfavorable con- 
notations listed in Table 3. 

To examine Ss’ selections statistically, a 
numerical score was assigned to each S’s se- 
lection on the basis of the number of favorable 
and unfavorable words chosen for each brand. 
This is shown in Table 4. 

This test permitted Ss to be regarded as 
“blocks,” and for price to be regarded as 
“treatments,” and a one-way analysis of 
variance (Winer, 1962) to be carried out. 
The results were significant at the p < .05 
level as indicated in Table 5. 

Apart from this analysis of Ss’ word choices, 
the overall direction of selections is readily ap- 


TABLE 6 


RELATIONSHIP BETWEEN PRICE OF BEER 
AND DESCRIPTIVE WORDS 


0 2282 
S UDN®Æ 


—— 


Brand of beer and price per six-pack 


Type of 
r ee 
chosen 

M ($1.30) | L ($1.20) | P ($0.99) 
T ae ae ae 
Favorable 93 73 57 
Unfavorable 71 82 101 


Note.—Totals chosen are number of words. 
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FAVORAKLE WOOS 
3 a Oe 
8 
3 
z 
t M 
LOW PRICE MEDIUM PRICE HIGH PRICE 


BRANDS OF BEER 


Fic. 1. Plotting of favorable and unfavorable descrip- 
word responses by brand and price. 


parent from a cursory examination of Table 
6 and Figure 1. 


Discusston 


Price was used by Ss as a cue to product 
quality. With a physically homogeneous prod- 
uct and unknown brand names (which had so 
little meaning that many Ss never used them), 


better than the lowest priced brand. In fact, 
if perception and reality had correlated per- 
fectly, each S would have tried each brand 
once and then chosen Brand P (the lowest in 


Brand M (high price) for over 41% of all 
trials and Brand L (middle price) for 25% 
of all trials. Price, in the absence of other 
cues, was a powerful determinant of how the 


Perceived quality did not vary linearly with 
price. The greater price disparity existed be- 
tween Brand L and Brand P, a total difference 
of 3¢ a bottle. The price difference between 
Brand M and Brand L was only 2¢ a bottle, 
However, the perception of quality was such 
that Brand L and Brand P were perceived as 
much closer than Brand L and Brand M. 
This study found no causal explanation for 
this phenomenon. A first-order Markov analy- 
sis of brand switching revealed that Brand L 
lost to both Brands M and P with the greater 
movement to Brand M. This analysis tended 
to confirm the significance of the ratings ob- 
tained for quality. 

The study strongly supports the earlier 
work that price is an independent variable on 
which perception of a brand or product de- 
pends. The extent to which this may be gen- 
eralized beyond the United States culture and 
over a wide range of products is unknown. 
Further research is clearly indicated. 
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COMPARISON OF QUESTIONNAIRE RESPONSES WITH 
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This study reports the results of a field experiment carried out in an inchustrial 
organization designed to test the impact on fixed-alternative 

answers of having some respondents be interviewed by a series of open-ended 
questions covering the same topics prior to taking the questionnaire. Support 
was provided for 1 of the 2 hypotheses tested. More negative data were pro- 


vided about peer relations by those who were in 


before doing the 


questionnaire than by those who were not. This finding held for both low- 


and high-skill employees. There was n 
conditions, 
context of other research in this area, 


under the 2 methodological 


o difference in the data about superiors 


The results were interpreted in the 
and 2 continua for explaining field 


method effects on questionnaire validity were proposed. 


The place of the interview in social science 
research has been a subject of some con- 
troversy. Some investigators have cautioned 
against the use of the technique because of the 
possibility that an interviewer might be able 
to lead a respondent to bias his answers to- 
ward those the researcher wanted (Dunnette, 
1962). Others have criticized the interview on 
the ground that it provides data only on a 
respondent’s perception of events (Becker & 
Greer, 1957). Nevertheless, a fairly extensive 
literature has developed around styles and 
types of interview (Kahn & Cannell, 1957; 
Richardson, Dohrenwend, & Klein, 1965; 
Whyte, 1960). Another methodological de- 
bate and inquiry has concerned the use and 
misuse of questionnaires. Public criticism has 
been voiced against “survey sickness,” the 
agonizing pursuit of questionnaire data to 
answer trivial questions in the misguided 
quest of science (Hechinger, 1966). There 
have also been a number of studies directed 
toward empirically determining some of the 
conditions which facilitate obtaining valid 
data by the use of questionnaires. 

One variable studied has been the effect of 
anonymity of respondents. Although a num- 
ber of studies have been reported, the results 
are not entirely consistent. Pelz (1959), for 
example, compared the questionnaire responses 
of Ss who were completely anonymous to 

1The writer would like to thank Richard A. 
Dunnington and Chris Argyris, Fritz Steele, and Wil- 
liam F. Whyte for their comments on an earlier ver- 
sion of this paper. 


those who were identified but confidential. 
The basic conclusion was that there were no 
differences between the two methods of ad- 
ministering questionnaires in terms of the at- 
titude scores obtained. The investigator, how- 
ever, cautioned that these results would prob- 
ably have been different if the study were 
done under different conditions, for example, 
if careful preparation had not been carried 
out, if there were fear of layoff, or if there 
were widespread hostility toward manage- 
ment. Klein, Maher, and Dunnington (1967), 
on the other hand, found that questionnaire 
responses were more positive for identified 
than for anonymous respondents. The answers 
were also more positive when a plant man- 
ager told the identified Ss to use an identify- 
ing code than when a member of the corpora- 
tion personnel research group made the same 
request. Specific questions also had a greater 
tendency to be positively distorted, namely, 
ones about pay and evaluation by one’s 
superior. 

The Klein, Maher, and Dunnington study 
was carried out entirely by company staff. 
Dunnette and Heneman (1956) compared the 
responses of employees to whom a question- 
naire was administered by their personnel 
manager with those to whom the same ques- 
tionnaire was administered by an independent 
university researcher. They found significant 
positive distortion on general morale items 
and on supervisory-subordinate issues by the 
respondents to whom the questionnaire was 
administered by the insider. 


335 
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Still another way to look at these methodo- 
logical issues is to ask about the effects of 
simultaneously using more than one method 
to investigate a problem area. For example, 
do respondents answer a fixed · alternative 
questionnaire differently if they have been 
interviewed prior to taking the questionnaire? 
The present study reports a field experiment 
designed to answer that question. 


Theoretical Issues 


One point of view about the research process 
suggests that it contains elements of mutual- 
ity (Argyris, in press; Maslow, 1966). If a 
researcher wants valid data, it helps if he is 
perceived as a trustworthy person. If he wants 
complete data, it helps if he makes himself 
accessible to observation, too. Of course, this 
is not the only way to conceive of the re- 
search process. One might also suggest that 
respondents do not want to know the re- 
searcher and prefer that he remain distant, 
uninvolved, and presumably neutral. How 
might one test between these two points of 
view using interview and questionnaire meth- 
odology? 

Suppose the researcher conducted an open- 
ended interview with each respondent before 
asking him to complete a questionnaire. The 
interview might serve a number of functions: 
(a) the respondent would become familiar 
with the topics of the study, (b) he would be 
able to answer the questions in his own terms 
rather than be forced into fixed-alternative 
answers on the questionnaire, (c) he would 
be able to ask questions of the researcher 
without being observed by other members of 
the organization, and (d) he would have con- 
siderable time to “size up” the investigator 
and to find ways to test his trustworthiness. 

If an interview provided a respondent with 
a mechanism whereby he could trust the re- 
searcher more, how would this affect question- 
naire responses? The impact of trust generally 
is to provide a climate of psychological safety, 
thereby allowing a person to talk more freely 
about potentially threatening issues. Some 
previous research provided a basis for a priori 
predictions about which research topics would 
be more likely to be distorted if adequate re- 
searcher-respondent trust were not present. 
Argyris (1962) has found that interpersonal 
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issues, such as dealing with emotions and 
facing conflict, tend to be difficult for many 
organization members in their day-to-day ac- 
tivities. If these issues present problems for 
people in their regular dealings with each 
other, then one would suggest that they would 
also be quite threatening in single encounters 
with an outsider whose trustworthiness might 
be open to question. In addition to the gen- 
eral threat posed by interpersonal issues, 
there is also the fact that some of the major 
issues in the client-researcher relationship are 
questions of trust, confidence, and respect. 
So one would expect the research encounter 
itself to evoke concerns about interpersonal 
issues. Consequently, the attitudes which 
would be expected to be more fully recounted 
if additional trust could be developed in the 
researcher-client relationship would be those 
dealing with interpersonal relationships. If 
the attitudes are more fully described, their 
overall sentiment would be more negative 
because the factors people would withhold 
due to lack of trust would be the negative 
aspects. The following hypotheses, therefore, 
were subject to test in a field experiment. 

Hypothesis 1: Respondents who were in- 
terviewed by a series of open-ended questions 
covering the same subjects as those of the 
questionnaire would give more negative ac- 
counts of satisfaction with respect from supe- 
riors than respondents who were not inter- 
viewed before taking the questionnaire. 

Hypothesis 2; Respondents who were in- 
terviewed by a series of open-ended questions 
covering the same subjects as those of the 
questionnaire would give more negative ac- 
counts of satisfaction with respect from peers 
than respondents who were not interviewed 
before taking the questionnaire. 


METHOD 
Procedure 


The study was carried out over a period of 3 mo. 
in a manufacturing organization employing approxi- 
mately 1,700 people. Three shifts were in opera- 
tion, and data were gathered on each of the shifts. 
A week preceding the time when an S was asked to 
appear to participate a notice was sent to him 
through his first-line supervisor. The memorandum 
indicated the time and place for the person to ap- 
pear and indicated whether he would be inter- 
viewed or asked to complete a questionnaire. 


* 
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| 
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When a person came for an interview be was 
greeted by the investigator and informed about the 
purposes and topics of the study. While this explana- 
tion proceeded, a small tape recorder was setting on 


be clearly seen that it was not turned on. When the 
jnitial introduction had been completed, the inter- 
viewer asked for the respondent's permission to 
the interview. The person was assured that no 
in the company would ever be allowed to listen 
to the tapes. 

All of the interviews were conducted by the 
writer, Almost without exception the questions were 
asked in the same order. Some sample questions 
were, “What do you like to get out of a job?” or 
“Would you describe your boss for me?” There were 
15 standard questions. Additional probing questions 
were used to clarify what a respondent meant when 
it seemed unclear to the investigator,* 

Questionnaires, for people not interviewed, were 
distributed during meetings attended by 10-20 per- 
sons at one time. The meetings were held between 
shifts and were attended by people who were leaving 
after having completed their work for the day. No 
more an three people from a given department 
were ed to attend a questionnaire administration 
at one time. No more than two people from a given 
shift in a given department were in in 1 day. 

Persons who were interviewed were given a ques- 
tionnaire and a stamped envelope addressed to the 
investigator after the interview had been completed. 
The questionnaire had a code number on the back. 
The S was told that this number existed so the re- 
searcher could keep all the information provided by 
each person together. The confidentiality of the study 
was reiterated at this point. 

Persons who attended questionnaire meetings 
either completed the instruments in the meetings or, 
if there was not sufficient time, also received a 
stamped envelope addressed to the researcher. Ques- 
tionnaires which were completed in the meetings had 
no identifying labels; neither names nor code num- 
bers were provided. Where the questionnaires were to 
be mailed back, then identifying numbers were used. 
The Ss were informed that the investigator wanted 
to be able to remind them in case they forgot to send 
back their questionnaires. Confidentiality was reaf- 
firmed. Approximately 60% of the questionnaires 
were completed in the meetings. 


Subjects 


The respondents in this study were all nonsuper- 
visory employees. Two groups of Ss are distinguished 
according to skill level. People holding the low-skill 
jobs had duties which included: custodial work, 
highly repetitive assembly-line operations, hi 

tending in continuous process operations, inspection, 
and stock accounting. The high-skill group included: 
the trade groups such as mechanics, „and 


2 A complete account of the interview questions 
and introductory procedures is given elsewhere (Al- 
derfer, 1966). 


veloped new Inspection standards asd worked sway 
from the production lines; and advanced muchine 
operators who tended highly complex machines and 
carried out selected stat functions as well, 

The Ss were randomly selected from the lint of 
employees in the organization. Approsimately 90% 
of the people who were selected partkipated is the 
study. Roughly one-third of each skill group were 
randomly assigned to be interviewed before taking 

questionnaire. Forty-four low-shill and J9 bigd- 


i 


Dependent Variables 
The data used to test the experimental hypotheses 
the questionnaires. 


Poy satisfaction: (7 items). 

Compared to the rates for similar work here, my 
pay is good. (agree) 

I find it hard to do some of the things I enjoy 
most on the pay I receive. (disagree) 

Respect by superiors satisfaction: (8 items). 

My boss takes account of my wishes and desires. 
(agree) 

My boss will play one person against another, 
(disagree) 

Respect by peers satisfaction: (8 items). 

1 can count on my co-workers to give me a hand 
when I need it. (agree) 

My co-workers are uncooperative unless it's to 
their advantage. (disagree) 

Use skills and abilities satisfaction: (12 items). 

My job requires that a person use a wide range of 
abilities. (agree) 

In my job I have same thing to do over and over. 
(disagree) 

RESULTS 


Table 1 presents the means and standard 
deviations for the two skill groups under the 
different experimental conditions on each of 
the four attitude scales. The data do not pro- 
vide support for Hypothesis 1, for either skill 
group. There are no statistically significant 
differences on the questionnaire measure of 
respect by superiors satisfaction between the 
two experimental conditions in either skill 
group. 

There is support for Hypothesis 2, how- 
ever. The low-skill employees who were in- 
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TABLE 1 


QUESTIONNAIRE MEASURES OF NEED SATISFACTION FOR 
QUESTIONNAINE-ONLY AND INTERVIEW- 
AND-QUESTIONNAIRE GROUPS AT 
Two Sent Levers 


Group | N | * sD z 
Pay satisfaction | | 
Low-skill questionnaire only ss | —03 | 1.01 
Low-skill interview-and -65 
Questionnaire sf 146% 13 
High-skill q aeationaasire-only 7s 12] 7 Li 
High-skill interview-and. 72 
Questionnaire 39| 25| 9 | 


Respect by superiors satisfaction 
Low-skill questionnaire-only ss 2s 19 


Low-skill interview-and- 72 
Questionnaire “u 35| .90 
High-skill Questionnaire-only 7S | —.17 | 1.20 
High-skill interview-and- —.70 
jonnaire 39 | -31 | 1.10 
Respect peers satisfaction 
Low-ski] questionnaire-only 88 «10 f 1.05 
Low-skill Interview-and- —1.95* 
Questionnaire 1 31.28 


High-skill — 78 -23| 74 


High-skill interview-and- 2.74% 

Questionnaire 39 | —.29 | 1.06 
Une skills and abilities satisfaction 

Low-skill questionnaire-only 88 | —.43 | 1.08 

Low-skill interview-and- ~ti 
Questionnaire 44) -as 93 

High-skill questionnaire-only 75 20 .80 

High-skill interview-and. —.50 


questionnaire 39 12] .87 


Note.—The attitude scale scores were standardized with a 
mean of 0 and standard deviation of 1 for the whole organiza- 
tion. Negative signs on the means refer to the dissati ‘action 
end of the scale; positive signs to the satisfaction end. 

* p < 026, one-tailed test, 

* Ss, one-tailed test. 


terviewed prior to completing a questionnaire 
gave significantly less satisfied accounts of 
Tespect by peers than the low-skill employees 
who were not interviewed. A similar finding 
holds for the high-skill group. The high-skill 
Tespondents who were interviewed prior to 
taking a questionnaire showed significantly 
more negative respect by peers satisfaction 
attitudes than the high-skilled employees who 
were not interviewed. 

Data on the other two attitude scales, pay 
satisfaction and use skills and abilities satis- 
faction, show no significant differences for 
either job group as a function of the experi- 
mental conditions, 

Perhaps it should be mentioned that the 
four dependent variables are essentially uncor- 
related with each other. Their median cor- 
relation is zero; the extreme values are .02 
and —.04. Therefore, tests on the separate 
scales are meaningful, and positive or negative 


findings on one scale are essentially indepen- 
dent of results on other scales. 

Also, the reliabilities of the four scales are 
reasonably high. Spearman-Brown estimates 
for the pay, superiors, peers, and abilities 
scales are, respectively, 88, .84, .79, and .88. 
Therefore, it is unlikely that failure to detect 
differences between experimental conditions 
can be attributed to lack of reliability in the 
dependent measures. 


Discussion 


The experimental findings provide support 
for only one of the hypotheses. This support, 
however, was reasonably strong in that it held 
for both the low- and high-skill groups. Why 
was there not support for the respect by 
superiors satisfaction variable as well as for 
the respect by peers satisfaction measure? 
Two explanations are suggested. 

The first implies that the experimental con- 
ditions were not strong enough. That is, while 
having an interview prior to doing a question- 
naire raised the respondents? trust enough so 
they revealed more negative attitudes about 
their co-workers, the procedure was not enough 
to have a similar effect on attitudes about 
superior-subordinate relations, This explana- 
tion assumes that discussion of superior-sub- 
ordinate relations is more threatening than 
discussion of peer relationships and therefore 
requires a higher level of researcher- client 
confidence for the respondents to feel free to 
admit to negative attitudes about their bosses. 

The second explanation says that as far 
as superior-subordinate relationships were con- 
cerned, the additional trust made no differ- 
ence because S was not very threatened. This 
is to imply that the respondents said all there 
was to say on the questionnaires, and the in- 
crement of trust which interviews might have 
added had no payoff in more valid data about 
superior-subordinate relations. Following the 
Dunnette and Heneman (1956) findings, one 
might reason that the investigator being an 
outsider may have been all that was necessary 
to obtain complete data on this topic. In 
terms of the data of this study, this explana- 
tion assumes that discussion of superior- 
subordinate relations was less threatening than 
discussion of peer relationships. 
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If the second explanation is valid, then com- 
bining the data from this study with those 
obtained by Dunnette and Heneman (1956) 
and by Klein, Maher, and Dunnington (1967) 
suggests two generalizations about the impact 
of field-research methodology on the validity 
of questionnaire responses. 

It should be emphasized, however, that 
these generalizations are intended to be quite 
tentative. They represent one way of bring- 
ing some order out of the results of the 
present and related studies on this subject. 
Since they are not empirically confirmed, they 


‘may best be seen as hypotheses suggestive 
of profitable directions for future research. 


The first generalization concerns a con- 
tinuum of research methods ranging from 
those which seem to lead to more response 
distortion to those which lead to less bias on 
questionnaires. The following order from most 
to least distortion is suggested: 


(a) Identified questionnaires with admin- 
istration in part by line management; 

(5) Unidentified questionnaires with ad- 
ministration in part by line management; 

(c) Identified questionnaires with adminis- 
tration by internal (personnel) staff; 

(d) Unidentified questionnaires with ad- 
ministration by internal (personnel) staff; 

(e) Identified questionnaires with admin- 
istration by external (university or research 
institute) researchers; 

(f) Unidentified questionnaires with ad- 
ministration by external (university or re- 
search institute) researchers; and 

(g) Respondents interviewed by external 
researcher prior to filling out questionnaires. 


The reader will note that although this 
order is presented as a single dimension, more 
than one variable enters the definition of 
the points, For example, who the adminis- 
trator is receives a heavier weight than 
whether the respondent is identified. Such an 
assumption involves going a bit beyond what 
the results of the studies cited above and the 
present results strictly show, but the exten- 
sion does not seem unreasonable. Furthermore, 
there is no way of knowing what effect in- 
terviewing preceding the questionnaire ad- 
ministration would have if the interviewer 
were an internal researcher. 


The second tentative pattern emerging from 
combining this study results with those of 
other writers is an ordering of Lawes whose 
validity increases as one moves along the 
methodological continuum. As the method 
provides more respondent-researcher trust, the 
issues seem to have more valid data provided 
about them in the following order: 


(a) Pay satisfaction; 

(b) Superior-subordinate relations, particu- 
larly around subjects of mutual trust and 
equity; and 

(c) Peer relations, concerning such issues 
as acceptance, respect, and trust. 


It would seem that there is a basic dimen- 
sion underlying the ordering of these issues. 
As one moves from pay, through superiors, 
to peers, the object of interest changes from 
a material factor external to an important 
person who (at least formally) is somewhat 
distant from the respondent to co-workers 
who, quite likely, are closer to the respondent, 
The underlying continuum appears to be 
intimacy. It seems that negative data about 
more intimate issues by questionnaire become 
more available as a safer and closer relation- 
ship is developed between researcher and te- 
spondent. 

When satisfaction with pay and satisfaction 
with use of skills and abilities did not show 
differences between the experimental condi- 
tions, indirect support was provided for the 
idea that the preceding interview affects pri- 
marily questionnaire responses about inter- 
personal relationships. From these data, there 
is little reason to suggest that an external re- 
searcher should invest the time and money re- 
quired to interview people if he does not want 
to obtain data about interpersonal relation- 
ships in the organization. On the other hand, 
where data about interpersonal relationships 
are desired, there is reason to believe that 
preceding questionnaire administration with a 
semistructured interview can reduce distor- 
tion of questionnaire responses. 


REFERENCES 


ALDERFER, C. P. Convergent and discriminant valida- 
tion of satisfaction and desire measures by inter- 
view and questionnaire. Journal of Applied Psy- 
chology, 1967, 51, 509-520. 


340 


At, C, P. Differential importance of human 
needs as a function of satisfactions obtained in the 
organization. Unpublished doctoral dissertation, 
Yale University, 1966. 

A, C. Some unintended consequences of rigor- 
ous research. Psychological Bulletin, 1968, in press. 

Axovnis, C. Interpersonal competence and organiza- 
tional effectiveness. Homewood, Jll.: Dorsey Press, 
1962. 


Becxes, H. S. & Grzze, B. Participant observation 
and interviewing: A comparison. Human Organiza- 
tion, 1957, 16, 28-32. 

Duxxxrrx, M. D, Personnel management. 
Review of Psychology, 1962, 13, 285-314, 

Duxxxrxrx, M. D, & Hxxxztax, H. G. Influence of 
scale on employee attitude responses. 
Journal of Applied Psychology, 1956, 40, 73-77. 

Hecurxcer, F. M. A survey of surveys, New York 
Times, September 11, 1966, p. 7E. 


CravtoN P. ALDERFER 


Raus, R. L, 4 Canwert, C. F. The dynamics of 
interviewing. New York: Wiley, 1957. 
Krew, S. M., Maner, J. R., & Duxxixcrox, R. A. 


jects in responding to an industrial opinion survey, 
Journal of Applied Psychology, 1967, 51, 152-160 
Mastow, A. H. The psychology 


Paxz, D. C. The influence of anonymity on expressed 
attitudes. Human Organization, 1959, 18, 83-91 

Ricuarosoy, S. A, Donrenwenn, B. S, & Krux. 
D. Interviewing: Its forms and functions. New 
York: Basic Books, 1965. 

Wuyte, W. F. Interviewing in field research. In R. 
N. Adams & J. J. Preiss (Eds.), Human organiza- 
tion research. Homewood, Ill.: Dorsey Press, 1960. 
Pp. 352-374, 


(Received September 20, 1967) 


een. 


ADDENDUM TO “AN EMPIRICAL INVESTIGATION 
OF TWO IMPLICATIONS OF THE TWO-FACTOR 
THEORY OF JOB SATISFACTION” 


GEORGE B. GRAEN axo CHARLES L. HULIN 
University oj Illinois, Urbana 


Multiple hypotheses were tested between the 2-factor theory of Herberg, 
Mausner, and Snyderman and the traditional theory of job satisfaction. A 
procedure was used that granted the assumption of the 2-factor theory of a 
dimensional difference between overall satisfaction and dissatisfaction. lastre- 
ments employed were designed to capitalize on any unique variance astocisted 
with the satisfaction or the dissatisfaction dimension. Analysis consisted of 
1-way analyses of variance of the Job Description Index variables of Work 
Itself, Promotion, Salary, Supervision, and Co-workers on overall satisfaction 
and on overall dissatisfaction. Results support the hypotheses of the traditional 
theory at the expense of the 2-factor theory. 


Attempts to test the two-factor theory of 
job satisfaction (Herzberg, 1966; Herzberg, 
Mausner, & Snyderman, 1959) employing 
methods other than storytelling, in addition to 
failing to support hypotheses derived from the 
theory, have assumed a single continuum of 
overall job satisfaction-dissatisfaction. The 
single exception was a recent study by Hulin 
and Smith (1967) in which special instru- 
ments were designed to measure independently 
satisfaction and dissatisfaction. The purpose 
of the present study was to examine these 
unique data from the perspective of strong 
inference (Platt, 1964). The strong-inference 
approach pits hypotheses from competing 
theories against one another and demands a 
determination of their relative value for future 
research. It differs from the more usual ap- 
proach by emphasizing hypotheses testing 
rather than theory confirmation (Dunnette, 
1966). 

Hypotheses derived from the two-factor and 
the traditional theory (Ewen, Smith, Hulin, 
& Locke, 1966; Graen, 1966) and tested in 
this study are as follows. The two-factor the- 
ory hypothesizes that those variables classified 
as “satisfiers” (Work Itself and Promotion) 
should control a significant amount of vari- 
ance in overall satisfaction and not in overall 
dissatisfaction, whereas those variables clas- 
Sified as “dissatisfiers” (Co-workers, Super- 
vision, and Salary) should control a significant 
amount of variance in overall dissatisfaction 
and not in overall satisfaction. In contrast, 


traditional theory hypothesizes that both so- 
called “satisfiers” and “dissatisfers” should 
control a significant amount of variance in 
overall satisfaction and overall dissatisfaction. 


Metnop 
The Ss were male and female home-ollice em- 


Itself, Promotion, Co-workers, Supervision, 
ary. In this study, 146 Ss completed the overall satis- 
faction measure, and another 155 completed the 
overall dissatisfaction measure. For further details, 
see Hulin and Smith (1967). 

consisted of performing one-way 
analyses of variance to test the effects of the five 


and overall dissatisfaction. 
RESULTS 


Tose variables classified as satisfiers (Work 
Itself and Promotion) showed significant re- 
lationships on satisfaction and on dissatisfac- 
tion. In contrast, those variables classified 
as dissatisfiers (Co-workers, Supervision, and 
Salary) showed significant relationships on 
satisfaction and not on dissatisfaction, The 
percentage of variance in satisfaction ac- 
counted for by the job variables was 239% 
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TABLE ! 


ANALYSIS OF Value AND OMEGA-SOUARE OF Jon 
VARIABLES ON Sarr ACTION AND DISSATISFACTION 


= = 
Dependent variable 
a ?“2D)ſ 
Independent } . | Dissatisfaction 
variable | Satisfaction only | only 
Pger K 
y| r aja) e % 
Satisier | 
Work itself 2 | 22.28° 06 
Promotion 2 | 14.05* 0 
Dissatisfier 
Co-workers 0¹ 
Supervision 01 
Salary O1 


* % indicates the Proportion of total variance accounted for 
. the, independent variable. 
es 01. 


for Work Itself, 16% for Promotion, 14% for 
Co-workers, 11% for Supervision, and 6% for 

- The percentage of variance in dis- 
satisfaction accounted for by the job variables 
was 6% for Work Itself, 9% for Promotion, 
and a maximum of 1% each for Co-workers, 
Supervision, and Salary. 


Discusston 


capitalize on any unique variance associated 
with each dimension are employed, the results 
clearly discon, 


vision, and Salary) contribute to satisfaction 
and not dissatisfaction are incompatible with 
the two-factor theory. 

e Present results are more compatible 
with the traditional theory. The traditional 


accounts for the differences on overall satisfac. 
tion-dissatisfaction between satisfiers and dis. 
satisfiers by hypothesizing that intrinsic vari- 
ables (satisfiers) are related more strongly 
to overall satisfaction-dissatisfaction than are 
extrinsic variables (dissatisfiers). The tradi- 
tional theory views the differences between 
satisfiers and dissatisfiers as Primarily a mat- 
ter of potency rather than a matter of direc- 
tionality. The only finding not Supporting the 
traditional theory was the failure of the dis- 
satisfier variables to contribute to dissatisfac- 
tion. This lack of relationship was probably 
due to the severe restriction of Tange in the 
measure of overall dissatisfaction. In future 
studies with samples representing lower levels 
of dissatisfaction, higher relationships would 
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A PSYCHOPHYSICAL DETERMINATION OF 
EQUITABLE PAYMENT: 


A METHODOLOGICAL STUDY * 


SHELDON ZEDECK axo PATRICIA CAIN SMITH 
Bowling Green State University 


The practicality of using psychophysical methods to determine ranges for 
payment plans was explored. Thresholds of perceived equitable payment 
(PSE) and the “just meaningful difference” (jmd) of payment were determined 
by an adaptation of the psychophysical Method of Limits. PSE and jmd were 
significantly greater for junior executives than for secretaries of an academic 
institution, whereas the respective Weber ratios, K (the proportionate meaning- 
ful additions to the base salary), were not significantly different. ‘There were 

t differences between 2 secretarial subgroups in jmd and K, but 


Several theorists have independently pro- 


under- and overcompensation and then evalu- 


y no signifñcani 
a significant difference in PSE. The limitations and implications of this method, 
and its relevance to equity theory, were discussed. 


posed models for the determination of equity 
of payment for work (Adams, 1963, 1965; 
Homans, 1961; Jaques, 1961; Patchen, 1961; 
Sayles, 1958; Zaleznik, Christenson, & 
Roethlisberger, 1958). Each model assumes 
that all employees perceive a fair, just, or 
equitable return for what they contribute to 
their jobs. Furthermore, each model involves 
a social comparison (an employee determines 
what his equitable return should be after com- 
paring his inputs and outcomes, or his invest- 
ments and rewards, with those of co-workers 
or reference groups). Also, each model as- 
sumes, implicitly, that the dissonant cogni- 
tion by the employee of over- or undercom- 
pensation results in feelings of tension, dis- 
satisfaction, anger, or guilt. A 
Methods of assessing and evaluating 
equitable payment differ. Typical methods in- 
dude interviews and discussions with em- 
ployers and employees, observations by in- 
vestigators, and the collection of biographical 
data, Adams’ (1963, 1965) extensive research 
characteristically induced the feelings of 


I This research is based in part on a master’s 
thesis at Bowling Green State University. 


ated Ss’ production records. 

A premise that has not been directly 
investigated is that an employee can evalu- 
ate his income as equitable or inequitable in 
terms of some composite internal absolute 
standard which is not tied to any one group ot 
person. A further related problem is the 
sensitivity of the worker to salary inequities. 
Both Jaques (1961) and Adams (1965) sug- 
gest that workers are more sensitive to under- 

tion than to tion. An- 
drews (1967) attempted to reflect this idea in 
the hypothesis that a small underpayment 
will affect job performance as much as or more 
than a large overpayment. The hypothesis was 
partially confirmed, yet there still is no direct, 
reliable evidence for the differential sensitivity 
hypothesis. 

The present study investigated the intuitive 
internal perceptions of salary. Also, the study 
investigates directly the sensitivity of workers 
to inequities of pay. To determine the upper 
and lower thresholds of perceived equitable 
payment and the “just meaningful difference” 
(jmd) of payment (analogous to the jnd in 
psychophysical measurement), the study used 
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an adaptation of the Method of Limits which 
is the one method that most clearly defines the 
Concept of thresholds (Woodworth & Schlos- 
berg, 1954, p, 199). This method also makes 
it possible to determine amounts of propor- 
tionate meaningful increases (K) to base 
salary. 


MetxHop 
Subjects 


Questionnaires were mailed to 25 male junior ex- 
ecutives (Group I), and 96 female secretaries—45 in 
the maintenance department (Group II) and 51 in 
the executive department (Group III)—of a large 
midwestern academic institution. Each had at least 
1 yr. of experience in his Position, 

Returns of the questionnaire were 80% (n= 20) 
for Group I, 51 % (n=23) for Group II, 45% 
(m= 23) for Group III, and 6 from the secretarial 
pool for whom the secretarial office could not be 
identified. 


Research Material 


The questionnaire contained general job-satisfaction 
questions, intended to disguise the intent of the 
study, and the basic questions necessary for the psy- 
chophysical analysis, The format of the basic ques- 
tion was: 


TABLE 1 
PSYCHOPHYSICAL DETERMINATIONS 
Psychophysical 
Group 5 — 2 df t 
PSE 
I $7920.00 15 
II III $3814.29 21 34 | 18.49% 
Ir $3574.29 7 
TIL $3919.09 11 16 2.875 
jmd 
I $ 912.00 15 
T&M} $235.71 21 34 6.19% 
I $ 295.71 7 
III 8 196.36 11 16 1.30 
K 
I 115 15 
Tl & III 062 21 34 58 
I 083 7 
III .050 11 16 20 
% <.05, 
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Considering the type of work required of you on 
your present job, is a year 
— 4 fair salary 
more than a fair salary 
less than a fair salary. 


The salary levels within the basic question (one 
level, randomly placed, per page) were based on 
the average salary and probable error (PE) for the 
respective groups. There were nine salary levels, four 
levels above and four levels below the average, 
Each successive change above and below the average 
was an approximation of the PE: $180 and $720 a 
year for secretaries and executives, respectively, 


Procedure and Design 


o criteria were used in evaluating the data 
the responses were recorded in 


serial order). First, if the responses within an em- 


= and at least two = responses, or an = and a 
+, such a series was not evaluated. Similarly, in 
descending inspection, the appearance of a + after 
two =s, or an = and a =, disqualified the question- 


Resutrts 


with 21 Secretaries forming the combined 
Secretarial group (including 3 unclassified 
secretaries), 


In comparison of the junior executives with 
the total secretarial pool and with each sample 
of secretaries, both the PSE and the jmd were 
significantly greater for the executives (Table 


1). In contrast, however, a comparison of the 


respective Weber ratios reveals that the 
Proportionate meaningful additions to the base 
salary were neither large nor significantly 
different. 

A comparison of the two secretarial groups 
showed that although the actual earned 
salaries for Groups II and III were within the 


Same range, the PSE was significantly greater 


ay 


Psycnopnysics AND EQUITABLE PAYMENTS 


TABLE 2 


PERCEIVED MEAN SALARY RANGES 


` 3 Equitable | „p 

Group 75 <Fair (PSE + jmd) | >Fair 
Junior executives) 15 | $7008.00 | $7008.00-8832,00 | $8852.00 
Secretaries 21 | $3578.58 | $3578.58-4050.00 | $4050.00 
Group II 7 | $3278.58 | $3278.58-3870,00 | $3870.00 
Group III 11 | $3722.73 | $3722.73-4115.45 | $4115.45 


for Group III, whereas the jmd was not sig- 
nificantly different. The difference between K 
was also small and not significantly different. 

Based on this information, equitable pay 
ranges for all groups were constructed (Table 
2). 

Since the sample size for each group was 
small, the combined sample of returned ques- 
tionnaires was analyzed (Table 3). The lower 
threshold, T(—), for each employee who had 
indicated an upper threshold, T(+), was com- 
pared with the T(—) of those employees who 
had not stated a T(+). For both executives 
and the total secretarial group, the T(—) was 
significantly smaller for the subgroups that 
had indicated a T(+). 


Discussion 


This study demonstrates the potential 
utility of this method, despite the small sample 
size. The results of this study are, of course, 
situation specific; the PSE and jmd that were 
determined apply only to the groups on which 
the data were collected. The results cannot be 
generalized to a similar work group in the 
same setting (as evidenced by the differences 
obtained with the two secretarial groups), 
much less to a similar group in a different 
setting. The method may, however, have 
general applicability. 

It is to be expected that, as an employee 
goes up the work-level scale, PSE and jmd 
will increase. However, the nature of the 
functional relationships involved is an interest- 
ing point for speculation based on the data 
obtained. Since there was no significant dif- 
ference between the K values for the groups, 
either Weber’s, Fechner’s (Guilford, 1954), 
or Stevens’ (1957) law might hold. Applying 
this technique to a greater number of groups 
along the work and pay-grade scale would 
reveal the nature of the wage curves in 
operation for any specific company. 


TABLE 3 


Lowex Turesson, T(—), ror Gaours wir ann 
wrrnout Urra Tiexsnotp, T(+) 


t With Without | i 
* r+) | r 
Junior executives | $7008.00 | $9150.00; 18 |367" 
(n = 15) | (» = 5) | | 
Secretaries | $3578.57 | $3893.48 | 22 | 3.01" 
(n = 21) | (n = 23) | 
"p -N N j J Te 


The technique also detects differences be- 
tween groups at the same job level. The sig- 
nificant difference between PSE for the two 
secretarial groups could be attributed to 
different working conditions for the two 
groups, to differences in personal needs, or 
other causes which can be subsequently 
investigated. 

Though the two secretarial groups had dif- 
ferent perceptions of fair salaries, the amount 
of meaningful increase was approximately the 
same. It is not suggested that the average 
K of 6.2% should be applied in determining 
raises, but that a range such as 5-8% be 
used. This range would be useful in determin- 
ing raises with consideration for education, 
experience, seniority, etc. The use of ranges 
should be applied for all determinations based 
on the method, instead of using the exact 
PSE, jmd, or K, and could correct for the 
individual differences within the groups. 

A point to be considered in using this tech- 
nique is that of sensitivity to inequities. 
Adams (1965), Jaques (1961), and Andrews 
(1967) suggest that Ss are more sensitive to 
underpayment as opposed to overpayment. 
The data are not inconsistent with these 
generalizations. Since the choices included 
four PEs above and four PEs below the 
average salary, one concludes that Ss were 
more willing to indicate inequities below than 
above equitable salaries. For the junior execu- 
tives, 100% of the sample indicated a Gas) 
whereas only 75% indicated a T(+). The 
results were more striking with the secretaries, 
98% of whom indicated a T(—) while only 
49% indicated a T(+). However, these find- 
ings cannot be taken as conclusive for it is 
possibly an artifact of the number and values 
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of the levels used, and might also indicate 
dissatisfaction. 

Since more than 50% of the returned ques- 
tionnaires did not meet the criteria for evalua- 
tion, an attempt was made to extract some 
information from the unevaluated question- 
naires by comparing the T(—) responses of 
the unevaluated with the evaluated question- 
naires. The significant difference between the 
T(—) responses for both the executives and 
secretaries could be interpreted as indicating 
that the differences were only in intervals of 
uncertainty (equitable range) and not a dif- 
ference in jmd. 

Limitations 

Several technical limitations must be con- 
sidered. The results were obtained by a mailed 
questionnaire. This method of data collection 
is hazardous for this type of study. It is im- 
portant, for validity purposes, that S fills out 
one page at a time and does not look back to 
his previous answers. The S should not have 
the opportunity to scan the questionnaire, 
familiarize himself with the levels, and then, 
with the aim of being consistent, fill in his 
responses. A suggested procedure is to ad- 
minister the questionnaire to a group of em- 
ployees simultaneously, requesting each S to 
answer one page at a time, hand in or place 
a completed page in an envelope, and then 
receive another page, the process being re- 
peated until the task is completed. Also, with 
this type of administration, the sample size 
would be increased. 

Another deficiency in the technique is the 
number and size of increments. The value of 
the increments should be sufficiently small so 
as to be most sensitive to the inequities. The 
desired sensitivity can be gained by following 
a test-retest procedure. After administering 
the questionnaire with nine levels of salary, 
one could retest the same group a short time 
after with the same filler questions, but with 
nine different, though appropriate, salary 
levels, resulting in 18 levels from which to 
determine PSE, jmd, and K. 


Implications 


Once the technique is refined, general in- 
formation as to who perceives himself fairly 
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paid, less than fairly paid, and more than 
fairly paid will be provided. On the basis of 
these determinations, several of the hypotheses 
that Adams and his associates have tested 
could be examined without inducing feelings 
of inequity. The quantity and quality of work 
for each employee could be compared and 
differences, if any, between those perceiving 
themselves equitably and inequitably paid 
evaluated. The influence of the input variables 
on perceptions could be examined; for ex- 
ample, what is the jmd for someone with 20 
yr. of experience as opposed to someone with 
5 yr. of experience? The effects of task dif- 
ferences on perceived equity for groups with 
similar salary scales could be investigated. 

Another analysis could be made of the 
relationship of a person’s perceptions to his 
satisfaction. It is conceivable that anyone 
who would deviate from the equitable interval 
would also be dissatisfied with his job and its 
conditions. 

Still another interesting analysis would be 
the determination of jmd for males as opposed 
to females in the same work group, or career 
women as opposed to women supplementing 
their families’ incomes. 

Finally, it would be possible to use the re- 
sults when considering the granting of raises 
and increases due to promotions, or for dis- 
tinguishing different levels of jobs. It appears 
that this would be an improvement over the 
arbitrary techniques that are presently being 
used in industry, As Jaques (1961) pointed 
out, equitable payment should prove econom- 
ically sound as well as socially just, because 
it would diminish wrangling and encourage 
efficiency. Furthermore, Opsahl and Dunnette 
(1966) found in their review of the role of 
financial compensation in industrial motiva- 
tion that companies “assign” each employee a 
“pay curve” which is the result of successive 
alterations in compensation and compensa- 
tion policies through the years. They noted 
that the usual way of doing this is with little 
or no advanced planning. Increments are 
simply given haphazardly on a year-to-year 
basis and the resulting career curve is equally 
haphazard. If this is the situation, if incre- 
ments are given without planning and regard 
for its meaning to the employee, then a better 
technique is needed, 
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SEARCHING FOR LETTERS OR CLOSED SHAPES IN 
SIMULATED ELECTRONIC DISPLAYS 


E. C. POULTON 
Applied Psychology Research Unit, Cambridge, England 


16 men counted the number of bright symbols of a particular kind in a simu- 
lated electronic display presenting a total of between 19 and 59 symbols of 8 
different kinds, In some displays all the symbols were bright, in others bright 


number of bright symbols present (p<.01). Dim symbols tended to be 


counted as bright (p < 01). 


Electronic display systems are capable of 
showing the locations of large numbers of ob- 
jects at a time on a single cathode-ray tube, 
Where the objects can be classified into 
classes, the objects belonging to a class can all 
be represented by the same symbol. For ex- 
ample, a different capital letter can be used for 
each class. Since part of the man’s difficulty 
in spotting items of a particular class consists 
in discriminating between them and the re- 
maining items, the letters used should be 
easily discriminable from each other (Howell 
& Kraft, 1959). Men working with displays 
may be accustomed to using 
simple shapes such as squares, triangles, and 
diamonds; in which case they are not likely 
to relish a change to capital letters unless 
letters can be shown to be more easily dis- 
criminable. The Principal aim of the experi- 
ment was to compare the discriminability of 

of capital letters with a set of closed 


discriminability. No experimental work com- 


very kindly made the slides. Statistical advice from 
P. M. E. Altham is also gratefully acknowledged. 


paring the relative discriminability of letters 
and shapes appears to have been published 
previously. 

Tn an experiment of this nature additional 
independent variables are the number of 
symbols of the class for which the man is 
searching, and the total number of symbols 
displayed (Sleight, 1952). If the symbols are 
Presented bright against a background of ad- 
ditional dim symbols of similar shapes, the 
number of dim symbols can be another in- 
dependent variable. A subsidiary aim of the 
experiment was to determine the effect of these 
three distinct sources of display clutter upon 
the man’s ability to Spot target symbols. 


METHOD 
Materials 


The four sets of symbols are illustrated in Figure 
1. The symbols were about 2.7 mm. tall, which is the 
height of 12-point newspaper capitals. The shapes in 
Column 4 were chosen to be representative of the 
kinds of shapes to which controllers might have be- 
come accustomed during the course of their duties. 
The capital letters in Column 1 were selected to give 
a set of shapes with features similar to those of the 


mon to Columns 3 and 4, But both columns con- 
tain three 
(see Table 2). Column 2 represents a degree of letter 
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ht. The pictures were finally reduced 


that t tives could be mounted in 


For each of the 4 g 
one of the row F 
targets. In tl 8 


each of the mbol in the set 
selected once. II mber of bright target 
on the slides t In the 24 test ar 
rangements with both bright and dim symbols, each 
of the 8 kinds n th 
the target I 16 test arrangements v 
only bright syr of 8 kinds of symbol was 
random with the 
0 syr as not a target in 
two tests d from the same basic pattern. The 
number of bright target bols on the slides range 
from 0 to 11. Where both bright and dim symbols 
were present, the total number of bright symbols 
ranged from 10 to 26, the total number of dim from 
11 to 47. Where only bright symbols were prese 
their numbers ranged from 19 to 55. 


0 6 


set was sele 


selected twic 
restriction th 


Experimental Design and Subjects 


For each of the four experimental conditions cor- 
responding to the four sets of symbols in Figure 1, 
the 40 experimental slides were arranged in four 
lots of 10, each lot containing 1 of each of the 10 
basic patterns. The order of the four lots, and the 
order of the arrangements within lots, was the same 
for each experimental condition 

Sixteen men acted as experimental Ss. Their ages 
ranged from the mid 20s to the late 50s. One group 
of four men was experienced with electronic displays 
of this nature, another group of four men was very 
experienced, while the other two groups were, re- 
spectively, not very experienced and inexperienced. 
Each man in a group of four received the four ex- 
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Procedure 


rately. He sat front 
r screen of 12 in 
h the slides were 
ately 


Each man was tested 


of a vertical semitranst 


diameter, upon the back of 
projected. The center of the scr 
level with his eyes, and about 18 in. away. He was 
desired to focus 


as approxir 


told that he could move the screen 
the image of a slide, and to center it 

Before each slide was presented the man was told 
which was to be the target symbol, and ether or 
not the slide would contain dim symbols as well as 
the bright ones which alone had to be looked at 
The slide v not presented until he indicated that 
he was ready to begin. As soon as the slide ap 
peared, he had to count all the bright target symbols 
and call out his answer, He was told to make sure 
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that his answer was correct before he gave it, but 
not to waste time unnecessarily. A stop watch was 
started as each slide appeared, and stopped as soon 
as he gave his answer. While the answer and time 
taken were being recorded by the E’s assistant, the 
man was told which was to be the next target symbol. 

After each of the eight practice slides the man was 
told whether his answer was right or not. If he was 
wrong, E went over the slide with him until he dis- 
covered his mistake. No knowledge of results was 
given after any of the 40 test slides. At the end of 
the session the man was thanked and generally en- 
couraged. He was asked which symbols he confused 
with which and his preferences for individual symbols 
and sets of symbols, Including instruction and prac- 
tice, the first session took an average of just under 
30 min., subsequent sessions just under 20 min. 
Normally each man performed one condition per 
day, or one condition in the morning and a second 
in the afternoon. But two men worked alternately 
one afternoon, since this was the only time at which 
they were available. 


Scoring and Calculations 


Errors were classified either as underestimations of 
the correct number of target symbols or as over- 
estimations. Since over half the errors were only one 
different from the correct answer, the sizes of the 
errors have been neglected in most of the quantita- 
tive analyses. Two-tailed statistical tests have always 
been used except where stated. 


RESULTS 
Differences between Sets of Symbols 


Incidence of underestimations. Table 1 gives 
the incidence of errors separately for the four 
sets of symbols, both when only bright sym- 
bols were present in the display, and when dim 
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symbols were present as well as bright. An- 
swers underestimating the correct number of 
target symbols are shown separately from 
overestimations. The data in the bottom row 
of the table, pooled over all four sets of 
symbols, show that the incidence of under- 
estimations was virtually unaffected by the 
presence of dim symbols in the display. In 
comparing the four sets of symbols on under- 
estimations, the results from all 40 slides have 
therefore been pooled, as shown in the right- 
hand column of the table. 

Analysis of variance on the pooled under- 
estimations indicated that the effects of sets 
of symbols and of practice were both highly 
reliable ( < .001). Differences between indi- 
viduals were also highly reliable, but the 
small differences between the groups of men 
with different degrees of experience of elec- 
tronic displays were not reliable. On Tukey’s 
method for multiple comparisons Ryan, 
1959), both the letters and the semiclosed 
letters gave reliably fewer underestimations 
than either the closed letters or the shapes 
(p < .05 or better). 

As a result of the Latin-square design, the 
shapes could have been at a disadvantage if 
there were greater positive transfer between 
the three sets of symbols resembling letters, 
than from these three sets to the shapes. To 
check that the underestimations of the shapes 
in the last column of Table 1 were not due 
entirely to this cause, the performances were 


TABLE 1 


MEAN PERCENTAGE OF INCIDENCE OF UNDERESTIMATIONS AND OVERESTIMATIONS 
ACCORDING TO SYMBOLS DISPLAYED 


16 slides with only 24 slides with bright and 
bright symbols dim symbols All 40 slides 
Set of symbols Under- 
Under- Over- Under- Over- ee 
estimations | estimations estimations | estimations 
ee a he a 
Letters 16.4 4a 12.0 p. ig 
Semiclosed letters 11.3 3.58 16.4 E 1 5 
Closed letters 19.2 12 23.2 3.1 21.6 
Shapes 28.9 8 21.4 3.6 24.4 
All 4 sets of symbols combined 19.0 TS 18.2 5.1 18.5 


* Letters-semiclosed letters p < .02. 
» Letters reliably different from all 
„ Letters-closed letters or shapes p<.01 

4 Semiclosed letters-closed letters or shapes h <.05 or better. 


other sets of symbols at the. 01 level or better. 


a 
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examined of the six men whose first two tests 
involved shapes and either letters or semi- 
closed letters. Two men first searched for 
shapes and then for semiclosed letters, while 
two other men had these two conditions in the 
reverse order. There were also a pair of men 
whose orders of searching for shapes and 
letters balanced each other. Of these six men, 
five gave more underestimations on the shapes. 
The sixth gave the same number of under- 
estimations on the semiclosed letters as on 
the shapes, but had the shapes as the second 
condition. Correcting for order, the difference 
between the shapes and the letters or semi- 
closed letters is reliable on a two-tailed sign 
test (p < .05). Thus the greater number of 
underestimations of the shapes in Table 1 is 
unlikely to have been due entirely to cumula- 
tive transfer effects. 

Incidence of overestimations and reported 
confusions. The bottom row of Table 1 shows 
that overestimations were over three times as 
common on the slides carrying both bright 
and dim symbols, as on the slides carrying 
only bright symbols. A Wilcoxon test (Siegel, 
1956, pp. 75-83) indicated that the difference 
was reliable (p < .01). The two sets of slides 
thus need to be treated separately on this 
measure. 

The slides with only bright symbols pro- 
duced a total of only 15 overestimations. A 
sign test indicated that there were reliably 
fewer overestimations with the letters than 
with the semiclosed letters (>< .02). None 
of the other differences between sets of sym- 
bols was reliable here. All but 1 of the 15 
overestimations were just one greater than 
the correct number. They could have been 
due to counting a single target symbol more 
than once, or to counting one wrong kind of 
symbol as a target. The single remaining 
overestimation was eight too large; it was 
probably due to counting W’s instead of U’s. 

On the slides containing both bright and 
dim symbols, half the overestimations were 
two or more greater than the correct answer. 
Wilcoxon tests indicated that there were re- 
liably (p < .01 or better) more overestima- 
tions on the series of slides with letter sym- 
bols than on any of the other three series, 
which did not differ reliably from each other 
(p > .05). Since the letters gave if anything 


TABLE 2 
Commonty Reroxrep CONTUSIONS BETWEEN 
Sympois 
Set of symbols Pairs confused 
Letters W and M 
Semiclosed letters Vand * 
Closed W and M 
Closed letters Closed V and Y 
Closed V and A 


Triangle and diamond 
Diamond and split diamond 
D on side and split D 


Shapes 


the fewest overestimations when only bright 
symbols were displayed, most of these over- 
estimations with the combined bright and dim 
symbols must have been due to counting dim 
symbols as bright. 

According to the reports of the experimental 
Ss, the most common confusions between sym- 
bols were those indicated in Table 2. 

Time taken. The mean time taken to search 
for all the bright target symbols and to call 
out the number counted was 11.9 sec. on the 
slides with only bright symbols, compared 
with 11.4 sec. on the slides with both bright 
and dim symbols. This difference was reliable 
on a Wilcoxon test against the differences be- 
tween experimental Ss (p< 02), but the 
difference was not reliable when tested against 
the differences between slides. (As will be 
pointed out later, the mean response time de- 
pended upon the number of bright symbols 
displayed. The group of slides with only 
bright symbols happened to include 4 slides 
with very large numbers of bright symbols, 
as well as 12 slides with average numbers— 
see Figure 3.) Separate analyses of variance 
on the two sets of slides indicated that the 
only other reliable effect was produced by 
the differences between individuals (p< .001). 
Mean time taken was not affected reliably 
by the set of symbols used, by practice dur- 
ing the experiment, nor by previous experi- 
ence with electronic displays. 


Sources of Clutter in the Display 


Incidence of underestimations. Figure 2 
shows how the incidence of underestimations 
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Fic. 2. Mean percentage of incidence of answers underestimating the number of target 
symbols, shown according to the number of target symbols actually present. (In A for 
slides with only bright symbols, the numbers printed beside the functions indicate the 
total number of symbols on the slide. In B for slides with both bright and dim symbols, 
the first number indicates the number of bright symbols on the slide, the number in 
brackets indicates the total number of combined bright and dim symbols present. Most 
points represent data from the same 16 men pooled over 4 slides, one with each set of 


MEAN INCIDENCE OF UNDERESTIMATIONS (%) 


symbols. But in functions with only 3 points, one of the points represents data pooled 


over 8 slides.) 


of the correct number of target symbols was 
influenced both by the number of target 
symbols to be counted, and by the total num- 
ber of bright symbols displayed. Each func- 
tion represents one basic pattern of symbols 
(see Method section). The number printed 
against the function gives the total number of 
bright symbols in the basic pattern. In 
Figure 2B, for the slides in which dim 
symbols were displayed as well as bright, 
the number in brackets gives the total num- 
ber of symbols in the basic pattern, both 
bright and dim. The four points on the func- 
tion are for the four different orientations of 
the basic pattern. Usually the number of 
signals of the kind designated as target was 
different for each orientation, but five of the 
functions have only three Points because the 
number of target signals was the same for two 
orientations. The data for the four sets of 
symbols have been pooled. The orientation of 


the basic pattern of symbols, the particular 
symbol designated as the target, and the 
serial position of the slide in the set of 40 
test slides have been treated as random vari- 
ables in this analysis. They may account for 
some of the residual variability displayed by 
the data. 

Each function in Figure 2 was fitted by a 
straight line using the method of least squares. 
In this first analysis the three points at the 
origin in Figure 2B were excluded, since it is 
not possible to underestimate the number of 
targets when none is displayed, yet any func- 
tion with a point at the origin is almost 
bound to show a positive slope. The com- 
puted slopes of all 10 functions were found 
to be positive (p < .01). This means that as 
more target symbols were added to the dis- 
play, the probability of failing to report at 
least one target symbol increased. 

Each of the 10 functions was therefore 
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fitted by a straight line passing through the 
origin, again using the method of least squares. 
The slopes found on this second analysis are 
given in Table 3. The fitted slopes of the 
functions in Figure 2A ranged from 2.1% 
underestimations per target symbol for a total 
of 19 symbols to 7.6% per target symbol for 
a total of 55 symbols. The tau coefficient of 
rank correlation (Siegel, 1956, pp. 213-223) 
between the fitted slopes and the total num- 
ber of symbols displayed was +.67. Thus the 
probability of missing at least one target sym- 
bol tended to increase more rapidly with the 
number of target symbols as the total number 
of symbols in the display was increased. 

The fitted slopes of the functions in Figure 
2B were rather steeper, ranging from 2.9 for 
a total of 10 bright symbols to 9.0 for a total 
of 26. The four functions for totals of be- 
tween 18 and 21 bright symbols all had 
steeper fitted slopes than any of the three 
corresponding functions in Figure 2A for 
totals of between 19 and 23 symbols. A Mann- 
Whitney U test (Siegel, 1956, pp. 116-127) 
indicated that the average difference was just 
reliable (p = .028 on a one-tailed test). Thus 
the probability of missing a target symbol 
increased when dim symbols were added to a 
display containing only bright symbols. 

For the functions in Figure 2B, tau be- 
tween the fitted slopes and the total numbers 
of bright symbols in the display was +.60. 
For the total numbers of both bright and dim 
symbols in the display, and for the total num- 
bers of dim symbols only, taus were only 
+.07 and —.07, respectively. Thus the prob- 
ability of missing a target symbol depended 
a good deal more upon the total number of 
bright symbols in the display, than upon the 
total number of bright and dim symbols com- 
bined, or of dim symbols only. 

Even the larger values of tau were not re- 
liable statistically (p > -05), as there were 
not enough functions in either of the figures. 
However where two functions have one or 
more points in the same yertical columns of 
the figure, it is possible to compare their rela- 
tive heights above the abscissa at these points 
with the number of target symbols held con- 
stant. In Figure 2A the functions for 55, 23, 
and 19 total symbols all have points in the 
columns for both 3 and 2 target symbols. For 
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TABLE 3 


Coururxb SLores or Foxcnions iw Fa, 2 


Total number of symbols | 


on slide | (Figure 2A) | (Figure 2B) 
| Bright sym- | Bright and 
i | bols onl i 
Bright Bright only |dim symbols 
+ dim) 
10 (57) | = 28% 
i 
18 (45) | . 
18 (48) 54> 
19 2.17 
19 (30) s37 
20 5.32 
21 (36) | 749 
23 4.07 
26 (59) 8.99 
55 7.61 
Note. — Mean ami pe incidence of 3 


per target symbol. 

= Of the functions for numbers of bright symbols rang! 
from 18 through 23, those for dim symbols as well (Figure 255 
all had steeper slopes (% < 0s on a one-tailed test). 


these two numbers of target symbols com- 
bined, the function for the total of 55 dis- 
played symbols lies reliably above the other 
two functions, which give almost identical re- 
sults (p < .02 or better on Wilcoxon tests). 
The function for 55 displayed symbols pooled 
over 4 and 2 target symbols also lies reliably 
above the function for 20 displayed symbols 
(p < 02). 

In Figure 2B toward the bottom, the func- 
tions for 10 (57) and 18 (45) total symbols 
both have points in the columns for both 3 
and 4 target symbols. For these two numbers 
of target symbols combined, the function for 
18 (45) displayed symbols lies reliably above 
the function for 10 (57) displayed symbols 
(p < 01). Toward the top of the figure this 
same function for 18 (45) displayed symbols 
pooled over 1 and 4 target symbols lies re- 
liably below the function for 26 (59) dis- 
played symbols (5 < 02). So does the func- 
tion for 18 (48) displayed symbols pooled 
over 4 and 5 target symbols ( < 01). Thus 
the total number of bright symbols in the 
display clearly affects the heights of the points 
in Figure 2. 

Sizes of underestimations. When two or 
more target symbols were displayed, under- 
estimations of the correct number of targets 
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Fic. 3. Mean time taken to count target symbols according to the number of target 
symbols present. (The parameters are the same as in Figure 2.) 


of size 2 and greater averaged 35% of the 
total number of underestimations. This pro- 
portion is a good deal larger than that pre- 
dicted by a Poisson distribution of under- 
estimations. It strongly suggests that on trials 
where the man had missed one target symbol, 
he was likely to miss one or more additional 
target symbols, 

As was to have been expected, the propor- 
tion of underestimations of size 2 and greater 
increased as the number of target symbols 
increased, from 25% when there were 2 
through 4 target symbols to 55 % when there 
were 7 or more target symbols. For the data 
pooled over Ss, the tau correlation between 
the proportion of underestimations of size 2 
and greater and the number (2 or greater) 
of target symbols displayed was +.56 (p 
< .05). 

Time taken. Figure 3 is similar to Figure 2, 
but the dependent variable on the ordinate is 
the mean time taken to count all the bright 
target symbols present, and to call out the 
number. In Figure 3A only the top function 
for a total of 55 bright symbols shows an 
appreciable slope. Here the difference between 


11 and 4 target symbols was highly reliable 
( < .001), and the difference between 4 and 
both 3 and 2 target symbols was also reliable 
($ < .01). In Figure 3B three of the six func- 
tions slope up to the right, while the other 
three slope down to the right. Clearly, except 
for the slides with 55 bright symbols, the 
number of target symbols did not appreciably 
affect the mean time taken to respond. 
However mean response time tended to in- 
crease with the number of bright symbols dis- 
played. In Figure 3A the top function for 55 
displayed symbols pooled over 3 and 2 target 
symbols lies reliably above the functions for 
both 23 and 19 displayed symbols (p < .001). 
Pooled over 4 and 2 target symbols it also lies 
reliably above the function for 20 displayed 
symbols (p < .001). At the bottom of the 
figure the function for 19 displayed symbols 
pooled over 3 and 2 target symbols lies re- 
liably below the function for 23 displayed 
symbols (p < .01). Pooled over 2 and 1 target 
symbols it also lies reliably below the func- 
tion for 20 displayed symbols (p < .001). The 
tau correlation between the mean heights of 
the four functions and the total numbers of 
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symbols on the slides was +1.00 (p < .05 on 
a one-tailed test). 

In Figure 3B the differences in height cor- 
relate with the total number of bright sym- 
bols on the slide only when there are few 
bright symbols. The function at the bottom 
for 10 (57) displayed symbols pooled over 4, 
3, and 0 target symbols lies reliably below 
the function for 18 (45) displayed symbols 
(p < 01). For 3 target symbols it also lies 
reliably below the function for 19 (30) dis- 
played symbols (% < .05). But the relation- 
ship between the total number of bright sym- 
bols on the slide and the mean time taken 
does not hold for the functions higher up in 
the figure. Whereas the function for 18 (45) 
displayed symbols is one of the lowest, the 
function for 18 (48) displayed symbols is on 
average the highest. The tau correlation be- 
tween the mean heights of the six functions 
and the total numbers of bright symbols on 
the slides was +.53. For the total numbers of 
both bright and dim symbols in the display, 
and for the total numbers of dim symbols 
only, tau was only +.07 and —.07, respec- 
tively. 

The mean response times of the functions in 
Figure 3A for totals of 19, 20, and 23 bright 
symbols can be compared with the means of 
the functions in Figure 3B for totals of 18 
(45), 18 (48), 19 (30), and 21 (36) symbols. 
Both lots of functions are for conditions with 
about the same total numbers of bright sym- 
bols, but the conditions of Figure 3B involved 
dim symbols also. A Mann-Whitney U test 
showed no reliable difference between the two 
lots of functions (p > .05). Thus the presence 
of dim symbols in the display did not ap- 
preciably affect the mean time taken to re- 
spond. 


Discussion 


Letters versus Shapes 


The right-hand column of Table 1 shows 
that the number of target symbols displayed 
was less likely to be underestimated when a 
set of capital letters was used than when a 
set of closed letters or shapes was used. A 
possible reason for the greater efficiency of 
the set of letters is that the forms of capital 
letters have been enormously overlearned. 


Ever since a child first learns the alphabet, he 
has been perceiving and identifying capital 
letters, whereas some of the forms of the 
closed letters and of the shapes are relatively 
unfamiliar (see Figure 1). Thus in searching 
for a particular letter among a set of letters, 
the man knew precisely what he was looking 
for; he may have been somewhat less sure 
when searching for a particular closed letter 
or shape. 

Another possible reason is that the letter 
forms may be more easily discriminable from 
each other than are the forms of the closed 
letters or the shapes. V, W, U, and Y all have 
an open end at the top; A and M have an 
open end at the bottom; N has an open end at 
both the top and the bottom; D is a fully 
closed letter; whereas the closed letters and 
shapes are all closed with the single exception 
of the U. It is clear from Howell and Kraft’s 
(1959) confusion matrices that a less con- 
fusible set of capital letters could have been 
chosen, and the same probably applies to 
sets of closed shapes (Bowen, Andreassi, 
Truax, & Orlansky, 1960). But in so far as 
some of the letters are closed and some are 
open with the openings pointing in different 
directions, the direction of the opening sup- 
plies a cue which is not available with sets 
of closed forms. This point does not always 
appear to have been appreciated in consid- 
ering sets of symbols for electronic displays 
(Bowen et al., 1960; Sleight, 1952; Vicino, 
Andrews, & Ringel, 1966). 


Cluttered Displays 


Figures 2 and 3 illustrate the effect of three 
types of clutter: total number of symbols dis- 
played, number of bright symbols when only 
bright symbols require attention, and number 
of target symbols requiring immediate atten- 
tion. Figure 2 shows that the probability of 
missing at least one target symbol increased 
more or less linearly with the number of target 
symbols displayed, as if each target symbol 
had an equal chance of being the first one to 
be missed, which was independent of the 
number of other target symbols present. The 
data for multiple underestimations suggest 
that once one target symbol had been missed, 
there was an increased chance of missing at 
least one more target symbol. 


target symbols, 
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of bright symbol which 
not relevant to his present purpose. 
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The chief effect of adding dim symbols to 
the display was to increase the probability 
of overestimating the number of bright target 
symbols (Table 1), some dim symbols being 
counted as targets. Table 3 shows that adding 
dim symbols also slightly but reliably in- 
creased the chances of missing at least one 
bright target symbol. Thus if the man does 
not need to refer to certain symbols, he should 
be able to remove them completely from the 
display, rather than being able only to reduce 
their brightness. 
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EFFECT OF CONTROL PLACEMENT ON INFORMATION 
TRANSFER RATE USING BISENSORY SIGNALS 


AD PERRIMENT 
Moneth University, Clayton, Victoris, Aurela 


A 2X 2 K 4 factorial experiment was designed to examine the effect 
various display-control relationships bad upon the rate at which Ss could 
vit information from a blsensory display. Ss were required to red to s 
Visual signal simultaneously presented with an auditory sigsal of equal we: 
certainty. Responses were made by depressing push butions. 
code variables examined were (1) the code carried by each 
limbs, (2) the code carried by the operating digits of sch band, and (3) 


Pec’ flexibility of the human component in man-eachine systems it alo 


illustrated. 
A previously reported study (Perriment & separated from the other by 6, 12, or 24 is. The Sù 
Webster, 1967) drew attention to some of the eed record the — omg pas 


factors influencing optimal display-control ar- ficht lamp flashed, and whether the 
rangements, where a bisensory signal required at the left or right earphone. As each 

i 3 t components required a binary 
a bimanual motor response. The present paper 3 
further two of the previous findings, which , 
were held to be of particular interest. One of 4 red 
these was the apparent superiority, in terms signal 
of information transfer rate (ITR), of con- sent 
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allowed 
other was the evidence obtained for complex ponent of the —— —— 
interaction effects between the three response- e tad 1 of the total response) in 
code variables studied. The work reported milliseconds. 
here was directed at examining the generality Subjects 
of the control separation effect, and investi- nla ea 
gating in greater detail the interaction effects Sixty-four volunteer Ss were drawn 


i troductory class, No inducement was 

obtained. offered for participation. The Ss were allocated to 

ofero rious treatment groups according to their at- 

Merson tendance at the laboratory. None had previous e 

A . task required. As the earlier, S07 
* had failed produce evidence supporting 


The apparatus has been described 

where (Perriment & Webster, 1967). For the present potha ea D a 1 
study, the response panel was modiſied to a allocat 

graduated lateral displacement of the response rea 

tons, Briefly, the signal display consisted of a 1 “4 Procedure 

cps tone and a light flash, simultaneously bere aha S ot at 2 D0 — 
through earphones and a pair of lamps located di- oe adie jin rizontal panel middle 
rectly in front. The buttons were arranged 3 * * 
panel in one of ton A the form 5 — 3 babs of buttons dane uae emrei “te ight 
directly in front o arrangemen erred a megs 3 8 

- to oe ects separation), or with each pair laterally band operated pair panel. 
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Fic. 1. Layout of signal display and response panel. 


and the leit hand those on the left. In conditions of 
zero separation, that is, the buttons being arranged 
in a rectangle directly in front, the right hand op- 
erated the pair farthest from S, and the left hand 
the pair closest to him. Figure 1 illustrates the layout 
of the response panel. 

The operating digits of the two hands carried 
either corresponding (DC) or noncorresponding 
(DN) codes. (See Figure 2.) For example, where 
the limb was coded to the signal’s direction of ori- 
gin, both index fingers responded to the visual 
component of the signal, one to direction tight and 
the other to direction left, as in Code Condition 3. 
Alternatively, one digit responded to the visual and 
the other to the auditory components, as in Condi- 
tion 4. Similar conditions of digit correspondence 
applied when the code carried by the limbs was re- 
lated to the sensory mode of the signal components. 
In this case, correspondence was defined as both 
digits carrying the same directional code, as in Con- 
dition 1, or different codes, as in Condition 2. The 
experimental design used is illustrated in Table 1 and 
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TABLE 1 


EXPERIMENTAL DESIGN 


Digit 
corresponding 


Digit 
noncorresponding 


Code 


Separation (in inches) 


Limb-mode 44444 
Limb- direction 4 4 4 44 


Note. Cell entries are number of Ss used. 


the display- control relationships generated by this 
design in Figure 2. Instructions appropriate to the 
code condition were read to Ss from a prepared text. 
This was followed by a training session consisting of 
two blocks of 100 signals each, separated by an in- 
terval of 3 min. A test series of 60 signals was then 
Presented. The previous study had shown that per- 
formance in terms of the information transfer rate 
(ITR) had continued to improve over 10 test trials 
of 20 signals each for the 80 Ss used, The training 
session employed in this study aimed at obtaining 
performance measures which were relatively unaf- 
fected by gross learning effects. 


RESULTS 


The dependent variable used for analysis in 
this study was the ITR achieved by S. 

The multivariate analysis technique sug- 
gested by McGill and summarized by Att- 
neave (1959) was used to calculate the in- 
formation measures used. ITRs were obtained 
by dividing the absolute amount of informa- 
tion transmitted by S ( T[x;y]) by the mean 
reaction time. The obtained ITRs were sub- 


Code Separation Oins. Separation 6-24ins. Code Group 
Condition 7 a 

® © visua Audio Visual 

1 O GO S O Limb- Mode 
© ® audio Digit- Corresponding (LM-DC) 
© ® Visual Visual Audio P 

2 Limb -Mode 
© ® Audio © © O O Digit - Non-corr.(LM-DN) 

Right Left Right 

3 ONE (ORO) O O Limb -Direction 

® © Left Digit - Corresponding(LD-DC) 


= Ee i sane 


® © Right Left 
j ® © Left ORO) 


Limb-Direction 
Digit- Non- corr. (LD- DN) 


717 Le a 


Fic. 2. Control-display arrangements for Limb: Digit: Separation conditions. 
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TABLE 2 


ANALYSIS OF VARIANCE SUMMARY OF THE OBTAINED 
Inrorsarion Transrer Rates (Ars / sxc) 


Source 


F ratios 
Control separation (A) 3 1.683 
Digit code (B) 1 2.726 
Limb code (C) 1 16.679** 
AXB 3 3.039* 
“AXC 3 6.626"* 
BXC 1 2.097 
AXBXC 3 402 
Error 48 
Total 


Note.—The homogeneity of the treatment variances was 
tested using Hartley's Fma test (Winer, 1962). The obtained 
F ratio of 65 was well below the level required for significance 
at the .05 level. 

+p 8.05. 

"p <.01. 


jected to an analysis of variance, the results 
of which are summarized in Table 2. 

Examination of Table 2 shows that the only 
variable making an independent contribution 
to the total variance is that of the limb code 
used. The mean ITR obtained where the op- 
erating limb responded to the mode of the 
signal components was 1.17 bits/sec. When 
the limb responded to the directional compo- 
nents, ITR increased to 1.40 bits/sec. That 
this effect was not consistent for all treatment 
groups is shown by the significance of the A X 
C interaction (Figure 3). The significant fea- 
ture of this effect is that it is confined exclu- 
sively to the displacement of the response 
buttons from a position on the midplane of S 
to the first position of lateral separation. 

To illustrate the effect of control separation 
under each of the four response codes, treat- 
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Fic. 3. Interaction of limb code and control separa- 
tion, (LD = limb direction; LM = limb mode.) 
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Fio. 4. Interaction of digit code and control sepa- 
ration. (Comparison between limb codes; LD = limb 
direction; LM = limb mode; DC = digit correspond- 
ing; DN = digit noncorresponding.) 


ment means are plotted separately in Figure 


4. 

As the values plotted in Figure 4 represent 
the performance of the 16 treatment groups 
separately, independent tests were made to 
determine the significance of the differences 
displayed between limb-digit code groups for 
each of the lateral separation conditions, and 
between each of the separation conditions for 
each limb-digit code group. (See Tables 3 and 
4.) Table 3 summarizes the effect upon ITR 
of laterally separating the response buttons 
for each of the limb-digit code conditions. The 
impression conveyed by Figure 4 is supported 
by the results of these analyses. Only the 
limb-mode-digit-noncorresponding condition 
was significantly affected by control separa- 
tion, and this was restricted to the displace- 
ment from 0 to 6 in. separation. Table 4 pre- 
sents the results of the separate analyses made 
of the differences between the means obtained 
by each of the limb-digit groups for the lateral 
separation conditions. The differences between 
the limb codes found in the main analysis of 
variance (Table 2) are reflected in these sepa- 
rate analyses, with the exception that the 
groups operating under the condition where 
the controls were separated by 24 in. did not 
differ significantly in performance. 


Discussion 


One feature of the results obtained from 
the present study, which is incongruent with 
those obtained earlier, relates to the signifi- 
cance of the control separation factor. In 
contrast to the superiority of centrally located 
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TABLE 3 
ANALYSIS OF VARIANCE SUMMARIES or OBTAINED INFORMATION TRANSFER 


Rates (srrs/sec): Four CODING ARRANGEMENTS SEPARATELY 


—— . ˙ö——— —— 
| Control code 
Source dj LM-DC LM-DN LD-DC LD-DN 
MS | F MS F MS F MS F 
Between groups 3 0791 75 6753 | 16,98" 3435 1.96 0484 69 
Within groups 12 1049 0397 | 1754 0712 


| 
Total 15 | 


Note.—LM = limb mode; LD = limb direction; DC = digit corresponding; DN = digit noncorresponding. 


Sl. 


controls, which was demonstrated in the 
earlier study, the present experiment produced 
no evidence of significant differences in per- 
formance which could be related directly to 
this factor. The explanation of this apparent 
inconsistency may be approached by consider- 
ation of the experimental designs used in the 
two studies. The previous study employed 
only two levels of each factor, giving a 2 x 
2 X 2 design in which the control separation 
factor had only two levels, centrally located, 
and 30 in lateral separation, As one of the 
aims of the present study was to determine 
the function which described the influence of 
control separation on ITR, a 2 * 2 X 4 design 
was used in which the separation factor had 
four levels, one of zero separation, and three 
of lateral separation. The results of the pres- 
ent study replicate, to some extent, those ob- 
tained previously in that significant differences 
in performance were found between some 


groups operating with central control place- 
ment and those operating with controls later- 
ally separated (Figure 4). Further increases 
in the extent of this displacement had no sig- 
nificant effect upon performance, however. 
With the display-control relationships used 
in this study, the effect of control placement 
upon ITR was found to be of significance for 
some limb-digit codes, but not for others. 
Where the limb-digit code was affected by this 
factor, the significant displacement was from 
a central location to that where the button 
pairs were separated by 6 in. 

As stated earlier, the aim of this study was 
to examine in detail the nature of the inter- 
actions previously observed between the three 
response variables examined. Although the 
question of which, if any, of the display- 
control arrangements was the “best” was not 
Considered, it is felt that some comments re- 
lated to this issue would not be inappropriate. 


TABLE 4 


ANALYSIS OF VARIANCE SUMMARIES OF OBTAINED INFORMATION TRANSFER 
Rates (Brrs/sEC): Four CONTROL SEPARATIONS SEPARATELY 


Control separations 


Source df 0 in. 6 in. 12 in. 24 in 
MS F MS F MS F MS 1a 
Between groups 3 4854 10.65** 15342 5.54* 4041 5.92* 2577 1.43 
Within groups 12 0455 0964 0682 1800 


Total 15 


5 05. 
* 01. 


In the course of analyzing the data from the 
resent experiment, Duncan's New multiple- 
range Test (Edwards, 1963) was applied to 

means of the 16 treatment groups. It was 
pund that the upper eight values plotted in 

are 4 formed a subset of means which did 
not differ significantly from each other at the 
05 level. It is of interest to note that the 
treatment groups concerned represent the fol- 
Towing coding arrangements: limb-direction— 


corresponding = 0 separation. This represents 
considerable variety of display-control rela- 
“tionships, and gives emphasis to the concept 
bility which is often quoted as an attri- 
bute of major importance of the human com- 


‘The results of this 
cord with previous studies 
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relationships signals. Human Fec- 
tors, 1967, 9, $. 
Woven, B. J. Statistical principle: in experimental 


design. New York: McGraw-Hill, 1962. 
(Received October 9, 1967) 
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EVOLUTIONARY PERFORMANCE, MANAGERIAL 
ABILITIES, AND CHANGE: 


AN EXPLORATORY INVESTIGATION OF ORGANIZATIONS 


ERIC P. SANDERS* 
San Diego State College 


each & (N = 150), data were collected from the Ghiselli Self-Description In- 
ventory, the Gough Adjective Check List, and the performance variable “num- 
ber of cars per train.” The main measurements used were means, standard 


that certain properties of an organiza- 
formance through the evolution of the 
a task and functional nature continually 


decrease from the inception to the completion of an organization. 


Over the past few years there has been a 
change in the types of problems which have 
drawn the attention of investigators in in- 
dustrial psychology. One of the new areas 
which has commanded interest may be termed 
“organizational psychology.” As Schein (1965) 
has indicated, this new field emphasizes the 
“recognition that organizations are complex 
systems . . . and human behavior within the 
organization has to be viewed from the per- 
spective of the entire. . . system Ip. 3].” 

The present problem is concerned not with 
the study of individuals comprising groups, 
but rather with an examination of the total 
organization’s behavior as a complex entity, 
where the interest is in the exploration of 
changes in the organization’s dynamic “life- 
Span.” 

The term “organization” as used in this 
study will be defined as a social group where 
the individuals within it are differentiated 
with respect to their function in connection 
with a task of achieving some common goal 
(after Stogdill, 1950). 

Organizational evolution is defined as the 
maturation of the organization from its in- 
ception to its completion. Ghiselli (1966) and 
Ghiselli and Lodahl (1958) have studied 
Properties of groups in relation to perform- 

A study done in partial fulfillment of the re- 
quirements for a doctoral degree at the University of 
The author wishes to thank 
Edwin E. Ghiselli, University of California, Berkeley, 
for his judicious help and constructive criticism 
which helped make this study possible. 


ance, and some work has been done in view- 
ing organizational properties (e.g., Chiselli, 
1963; Smelser, 1961). However, the relation- 
ship between organizational properties and 
performance over time has not been investi- 
gated and is the concern of the present study. 

The other part of this investigator’s con- 
sideration will be focused upon “organiza- 
tional stability.” As the organization develops 
it tends to become fixed in its procedures. 
This is implicitly stated in Weber’s (1946) 
classic bureaucratic theory and other theories 
in small group research, but it has not, to 
this researcher’s knowledge, been examined 
in an experimental setting. 

The two major problems, therefore, upon 
which attention is focused in this investiga- 
tion are: 

(a) organizational evolution and the prop- 
erties of the organization’s membership which 
affect performance during evolution, and 

(b) organizational stability—whether or 
not an organization maintains its original 
functions or whether they are changed and 
modified by the members over time. 


METHOD 


In order to examine the problems raised above, a 
complex model railroad task was developed. This 
type of apparatus, but on a much simpler scale, was 
used before by Ghiselli and Lodahl (1958), Ghiselli 
(1966), and Smelser (1961) for studying group per- 
formance. The railroad is a useful task for the 
Present purpose because it is inherently interesting 
and motivating for the young men used as Ss. 

Fifty three-man organizations worked on the actual 
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possess certain manageria! abilities. The 
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the Ghiselli SDI relevant to this investigation were 
titled: Intelligence, Initiative, Ability, 
Perceived Occupational Level, Ap 
proach, and Maturity. The Gough ACL is used to 


i 


describe certain psychological attributes of a 
used Achievement, Domi- 


The reliability of performance was 

the odd-even correlation As a measure of 
“efficiency” of performance (rather than 
amount), “cars per train” was used. ‘The uncorrected 
reliability of this measure was 70. 

In order to ascertain whether or not changes took 
place, specific job changes and changes in functional 
procedures were recorded. 


RESULTS 


The data indicate that as organizations 
evolve performance improves considerably. 
The mean number of “cars per train” (ef- 
ficiency) increased from 3.80 to 4.40 from 
Stage I to Stage III, 
ence being significant at the. Ol level. 

Although one might expect that, as the in- 
telligence and initiative of the organization 
members, on the average, increase, the better 
would be the performance at the inception of 


ABILITIES AND PERFORMANCE 
INITIAL AND FINAL. STAGES OF ORGANIZA- 


TION EvOLUTION 


Average intelligence 
Average initiative 
Average managerial ability 


*p=.05. 


Stages | Lew DMA | High DMA 
I “s mw | 9 
m “wo so | ur 
oe = 


the and 

evolution. The data indicate that fewer man- 

agerial abilities are manifested, on the aver- 
membership as per- 


If there were one person who was more 
highly differentiated with regard to his de- 
cision-making approach (DMA) than the 
other members of the organization, the better 


in the differences in 

the final evolutionary stage (Stage III). 

quite significantly estab- 
Table 3, that there is 
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greater stability (fewer organization changes) 
at the later stages of organization evolution 
than at the earlier stages. The number of 
changes dropped off some 80% from the first 
to the last stage. 


DISCUSSION 


Generally speaking, performance through- 
out the “life” of the organization continually 
improves. This supports the results found by 
Ghiselli (1966) in that the “learning” of total 
organizations appears to follow a similar pat- 
tern to that of individuals—practice improv- 
ing performance with the improvement shown 
in the characteristic negative acceleration. 
Since this “learning” or performance was 
measured in terms of a single organizational 
score (number of cars per train), rather than 
just the sum of aggregate member scores, it 
can be termed organizational learning. 

The mean intelligence level of the groups 
forming the organizations is unrelated to per- 
formance whether early or later in the or- 
ganization’s history. This agrees with Ghiselli’s 
(1966) results which indicate that organiza- 
tion performance is not necessarily related to 
the mean scores for the organization. Since 
intelligence is defined here as being a wide 
domain of mental abilities rather than any 
particular traits, it is possible that perform- 
ance during the initial stages of organizational 
evolution may not be related to the mean 
intelligence of the organization members, but 
may be accounted for only by those certain 
properties associated with particular behavior 
at this evolutionary stage (e.g., ability to deal 
with ambiguity, etc.), which may be masked 
by using a mean score measure, the implica- 
tion being that coordination among members 
of an organization may be impeded in co- 
operative situations by good problem solvers 
(implied high intelligence ). 

Initiative, on the other hand, presents an- 
other picture. Initiative is defined by Ghiselli 
(1956) as “involving the beginnings of ac- 
tions and the capacity to note or discover 
new means of goal achievement [pi 311. 
This implies acting independently of others 
and the ability to develop some sort of struc- 
ture from a nonstructured situation. If an 
organization has members who score higher, 
on the average, on initiative, the better will 


be the performance during the initial stages 
since these individuals will be better able to 
structure the situation. Such appears to be 
the case with the groups studied. 

It seems that if an organization differen- 
tiates a single member who is a strong deci- 
sion maker (high positive skewness), while 
the remainder of the members are relatively 
homogeneous, then the later performance of 
that organization tends to improve. Since there 
is a relationship between those members who 
score high on the DMA skewness scale and 
high in initiative (r= 24, p> 10) and 
since initative is significantly related to cer- 
tain managerial abilities (e.g., supervisory 
ability, 1 = .48; perceived occupational level, 
fip = .59; achievement, ia = .39; and domi- 
nance, 7i = .45; all significant at the .01 
level), one would expect that an uncontested 
strong decision maker in the organization 
would not only be able to influence other 
member’s performance but also, resulting from 
his self-confidence and strong initiative, main- 
tain its upward direction through the organiza- 
tion’s evolution. Generalizations to all types 
of organizations, however, must be made with 
caution since the results are based on loosely 
structured task groups which allow flexibility 
of interaction as well as complete freedom in 
sharing in the decision-making process. With 
a strong hierarchically structured “formal or- 
ganization,” these results might show a differ- 
ent type of relationship. However, in this 
situation, it is evident that with a highly 
differentiated decision maker, the organiza- 
tion will perform better later. These results 
coincide with some of the classic bureaucratic 
theories of Weber (1946) and the studies by 
Ghiselli and Lodahl (1958) and Ghiselli 
(1966). 

The classic bureaucratic assumption regard- 
ing organization stability is tenable, accord- 
ing to these data. This indicates that at the 
inception of an organization there tends to be 
an unstructured, ambiguous situation wherein 
the members are just trying to understand 
their particular tasks, trying to fit their tasks 
into the total pattern of organizational func- 
tioning, and looking for some unifying direc- 
tion. Resulting from this situation, more 
changes occur in the tasks, procedures, and 
structure during the early stages of evolu- 


tion, which is to say that the stability of 
organizations increases throughout organiza- 
tional evolution. 
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TESTING TRADITIONAL AND TWO-FACTOR HYPOTHESES 


CONCERNING JOB SATISFACTION 


GEORGE B. GRAEN* 


University of Illinois 


Traditional and 2-factor theories make contradicting hypotheses concerning 
the relationships between satisfaction with job factors and overall job satisfac- 
tion. Traditional theory predicts essentially linear relationships, whereas 
2-factor theory predicts particular nonlinear relationships. Ss were 167 male 
and 152 female office workers employed by a single corporation. Data collected 
included satisfaction with work itself, promotion, and pay as measured by 
the scales of the Job Description Index and overall job satisfaction as measured 
by the General Motors Faces Scale. Data were analyzed by analysis of 
variance. Results indicated not only that 2-factor predictions were not con- 
firmed on 4 out of 6 relationships but also that 3 of these 4 misses were 


opposite that predicted. Traditional predictions generally were supported. 


The two-factor theory of job satisfaction 
(Herzberg, 1966; Herzberg, Mausner, & 
Snyderman, 1959) hypothesizes two sets of 
job variables—job-content or “satisfier” vari- 
ables and job-context or “dissatisfier” vari- 
ables. Moreover, these two sets of variables 
(job-content and job-context) are assumed to 
contribute to overall job satisfaction in dif- 
ferent manners. Job-content variables are as- 
sumed to account primarily for variance on 
the satisfaction portion of the continuum of 
job satisfaction-dissatisfaction. According to 
two-factor theory, when the positive aspects 
of job-content variables, such as work itself 
and advancement, are present on the job, 
they can increase job satisfaction from in- 
difference to lust for the job. However, when 
the negative aspects of these variables are 
present, they can do little or nothing to pro- 
mote dissatisfaction. In contrast, when the 
positive aspects of job-context variables, such 
as salary, are present, they cannot, according 
to the theory, increase satisfaction much 
above indifference. However, when their nega- 
tive counterparts are present, they can in- 
crease dissatisfaction from indifference to 
scorn for the job. Thus, two-factor theory 
predicts nonlinear relationships between satis- 
faction with job variables and overall job 
satisfaction. 

As an alternative formulation, the tradi- 
tional theory of job satisfaction (Ewen, Smith, 

+The author wishes to thank Marvin D. Dunnette 


and Charles L. Hulin for their cooperation and sup- 
port, 


Hulin, & Locke, 1966; Graen, 1966) hypothe- 
sizes that if job variables are categorized ac- 
cording to the two-factor classification, both 
job-content and job-context variables account 
for variance over the entire satisfaction—dis- 
satisfaction continuum. Moreover, job-content 
variables are more potent contributors to sat- 
isfaction-dissatisfaction than are job-context 
variables. Traditional theory hypothesizes 
that if the presence of a variable contributes 
to job satisfaction, the absence of that vari- 
able will contribute to dissatisfaction and 
vice versa. Thus, traditional theory predicts 
essentially linear relationships between satis- 
faction with job variables and overall job 
satisfaction. Viewed in this way, two-factor 
and traditional theories specify different func- 
tions between job variables and overall job 
satisfaction. Two-factor theory predicts non- 
linear relationships; traditional theory pre- 
dicts essentially linear relationships. 

Two recent articles (Ewen et al., 1966; 
Graen, 1966) have presented evidence indi- 
cating the two-factor hypotheses do not hold 
for work itself, promotion, or pay. These in- 
vestigators found that dissatisfaction with the 
job-content variables (work itself and promo- 
tion) contributed to overall dissatisfaction 
and that satisfaction with the job-context 
variable (pay) contributed to overall satis- 
faction. Taken as a whole, these results clearly 
supported traditional theory at the expense 
of two-factor theory. 

These results of Ewen and Graen using 
male industrial workers so clearly contradicted 
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two-factor predictions that further research 
is needed to determine their generality. The 
purpose of the present study was to test the 
generality of the Ewen and Graen findings 
by applying their methods to samples of male 
and female office workers. As in the Ewen 
and Graen studies, two major hypotheses 
were tested on which two-factor and tradi- 
tional theories conflict. These hypotheses are 
stated below. 

Hypothesis 1. If one group of employees is 
dissatisfied, a second group neutral, and a 
third group satisfied with regard to a job- 
content (satisfier) variable, two-factor theory 
predicts equal overall satisfaction for the first 
and second groups and higher satisfaction 
than the other two groups for the third. In 
contrast, traditional theory predicts an in- 
creasing linear trend in satisfaction from 
the first to the third group. 

Hypothesis 2. If one group of employees is 
dissatisfied, a second group neutral, and a 
third group satisfied with regard to a job- 
context (dissatisfier) variable, two-factor the- 
ory predicts equal overall satisfaction for the 
second and third groups and lower satisfaction 
than the other two groups for the first. In 
contrast, traditional theory again predicts an 
increasing linear trend in satisfaction from 
the first to the third group. 

In addition to these two major hypotheses, 
the Ewen and Graen studies generated two 
other hypotheses that will be tested in the 
present study. These two hypotheses are 
stated below. 

Hypothesis 3. Job-content variables are 
more strongly related to overall satisfaction- 
dissatisfaction than are job-context variables. 

Hypothesis 4. The relationships between 
job-context variables and overall satisfaction- 
dissatisfaction depend on the level of satisfac- 
tion with the job-content variables. 


METHOD 


The Ss were home-office employees of one Cor- 
poration. All employees were surveyed as part of a 
job-attitudes study conducted by the company. A 
mail questionnaire assuring anonymity to the re- 
spondents was administered through departmental 
representatives. A gratifyingly high rate (837%) of 
return was obtained and all questionnaires were 
usable. The returns included 167 questionnaires from 
males and 152 from females. 
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Among other instruments included in the ques- 
tionnaire were the Job Description Index (CDI) and 
the General Motors Faces scale. The JDI developed 
at Cornell is an adjective checklist dealing with five 
job variables, namely, work itself, promotion, pay, 
supervision, and co-workers. The JDI has been 
shown to acceptable convergent and dis- 
criminant validity (Hulin, Smith, Kendall, & Locke, 
1963; Kendall, Smith, Hulin, & Locke, 1963; Locke, 
Smith, Hulin, & Kendall, 1963; Locke, Smith, 
Kendall, Hulin, & Miller, 1964). This study employed 
only the work itself, promotion, and pay scales of 
the JDI. The General Motors Faces Scale (Kunin, 
1955) was used as the measure of overall job satis- 
faction. For details of the data-collection procedure, 
see Hulin and Smith (1967). 

In the analyses of Ewen and Graen, the variables 
of work itself and promotion were analyzed as a 
composite variable. In the present study, work itself 
and promotion were analyzed separately. Analyses 
employed a weighted means, two-way analysis of 
variance to test the relationships between the JDI 
variables of ‘work itself, promotion, and pay and 
overall job satisfaction measured by the 
Motors Faces Scale, These analyses were performed 
separately for males and females. Each of the three 
JDI variables (work itself, promotion, and pay) was 
trichotomized into dissatisfied, neutral, and satisfied 
with regard to the job variable. Cutting points on 
these JDI variables were supplied by Ewen (1965). 

In all ANOVAs, the first dimension was a job- con- 
tent variable (either work itself or promotion), and 
the second was the job-context variable (pay). Each 
namely, Gaume 


by the General Motors Faces Scale. In addition, a 
priori contrasts were tested on the main effects 
(Winer, 1962). For each dimension, one contrast 
(Al and B:) tested the difference on overall satis- 
faction between the group dissatisfied and the group 
neutral with regard to th 
contrast (Ae and Bs) tested the difference on overall 
satisfaction between the group neutral and the group 
satisfied with regard to the JDI dimension. Omega- 
squared (Hays, 1963) was employed to measure the 
strength of relationships between JDI dimensions and 
overall job satisfaction. Finally, the trend in means 
on overall satisfaction for groups dissatisfied, neutral, 
and satisfied with regard to each JDI variable was 
tested for curvilinearity (McNemar, 1962). 


RESULTS 


Results of the ANOVAs for male employees 
are shown in Tables 1 and 2. According to 
Tables 1 and 2, work itself was related to 
high and not low overall satisfaction; promo- 
tion was related to low and not high satisfac- 
tion; pay was related to high and not low 
satisfaction. Thus, work itself was predicted 
and promotion was not predicted in Hypothe- 
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TABLE 1 


ANALYsIS OF VARIANCE FOR A PRIORI CONTRASTS AND 
Omeca-SguaRED FOR STANDARD EFFECTS OF THE 
Satisrrer (WORK) AND THE DIssATISFIER 
(Pay) DIMENSIONS ON OVERALL JOB 
SATISFACTION FOR MALES 


Source of variation | df | MS F Omega? 

Satisfier (work) 2 | 31.88 | 13.23** 12 

Al: Sp versus Sx 1 | 6.89} 2.86 

Az?Sy versus Ss 1 | 23.04] 9.56** 
Dissatisfier (pay) 28.70 3.61* .03 

Bi: Dp versus Dx 1 10} 04 

B:: Dy versus Ds 1411.06] 4.59“ 
Interaction 4] 5.58 2.23 03 
Error 158 | 2.41 


Note.—S and D represent the satisfier and dissatisfier 
factors, respectively. Subscripts D, N, and S represent the 
dissatisfied, neutral, and satisfied levels, respectively. 

29 8 .05. 

p< 01. 


sis 1 by the two-factor theory. Moreover, 
promotion not only failed to support this 
two-factor hypothesis, but actually demon- 
strated a relationship the reverse of that 
predicted. Two-factor theory predicted, in 
Hypothesis 1, that content variables should 
show significant differences, if any, between 
neutral and satisfied groups and not between 
dissatisfied and neutral groups. Although this 
was the case for work itself, the opposite was 
the case for promotion. In Hypothesis 2, two- 
factor theory predicted that context variables 


TABLE 2 


ANALYSIS OF VARIANCE FOR A Priori CONTRASTS AND 
Omeca-SquareD FOR STANDARD EFFECTS OF THE 
SATISFIER (PROMOTION) AND THE Dris- 
SATISFIER (PAy) DIMENSIONS ON 
OVERALL JOB SATISFACTION 


FOR MALES 
Source of variation | df | MS F Omega? 
Satisfier (promotion) 2 | 33.08 | 13.48** 12 
Al: Sp versus Sy 112.38 5.05 
Az: Sx versus Sg 1| 604] 2.47 
Dissatisfier (pay) 2 13.94 5.68** 05 
Bi: Dp versus Dx 104 
Be: Dx versus Dg 1 9.08 3.71 
Interaction 43.16 1,29 01 
Error 158 | 2.45 


Note.—S and D represent the satisfier and di i 
factors, respectively. Subscripts D, N, and S ica an 
Bee eee and satisfied levels, respectively, 

5 = 101, 
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should show significant differences, if any, be- 
tween dissatisfied and neutral groups and not 
between neutral and satisfied groups. In con- 
tradiction to this hypothesis, pay showed a 
relationship opposite that predicted by two- 
factor theory. Overall, these results for male 
employees confirm two-factor predictions for 
only a single content variable (work itself) 
and strongly disconfirm two-factor predictions 
for both the second content variable (promo- 
tion) and the context variable (pay). 

Results of the ANOVAs for female em- 
ployees are presented in Tables 3 and 4. As 
shown in Tables 3 and 4, work itself was re- 
lated to both high and low overall satisfac- 
tion, and both promotion and pay were re- 
lated to high but not low satisfaction. In the 
case of females, work itself was not predicted 
and promotion was predicted in Hypothesis 1 
by two-factor theory. In agreement with the 
results for males, pay showed a relationship 
opposite that predicted in Hypothesis 2 by 
two-factor theory. These results for females 
were similar to those for males with the ex- 
ception that promotion was predicted and 
work itself was not predicted. Therefore, two- 
factor predictions fared no better in either 
sample: Hypothesis 1 was supported by one 
variable and not supported by the other vari- 
able; Hypothesis 2 was not supported. 

Results on the predictions of traditional 
theory are shown in Table 5. According to 
Table 5, the means on overall satisfaction 


TABLE 3 


ANALYSIS or VARIANCE FOR A Prrort CONTRASTS AND 
OmEGA-SQUARED FOR STANDARD EFFECTS OF THE 
SaTisrieR (Work) AND THE Dissatsrier (PAY) 
DIMENSIONS ON OVERALL JoB SATISFACTION 
FOR FEMALES 


Source of variation | df | MS i Omega’ 
— 
Satisfier (work) 2 91.50 34.81" | .27 
Al: Sp versus Sy 1 | 58.06 | 22.08* 
Ao: Sw versus Sg 1 | 25.05} 9.52* 
Dissatisfier (pay) 29 78 7.52" .05 
BI: Dp versus Dy 1 67 25 
By: Dy versus Dg 1 | 31.57 12.00“ 
Interaction 4 1.38 52 00 
Error 143 2.63 


Note. —8 and D represent the satisfier and dissatisfier 
factors, respectively. Subscripts D, N, and S represent 
epee neutral, and satisfied levels, respectively. 

p< 01, 


i 


Jos SATISFACTION 369 


TABLE 4 


ANALYSIS OF VARIANCE FOR A PRIORI CONTRASTS AND 
Oweca-SQUARED FOR STANDARD Errects OF THE 
SATISPIER (PROMOTION) AND THE DIssATISFIER 
(Pay) DIMENSIONS ON OVERALL Jon 
SATISFACTION FOR FEMALES 


Source of variation | df | MS | F | Omega? 
2 | 36.20 | 10.73** 10 
1| 4.00 1.19 
| 1 | 20.91 6.20˙ 
Dissatisfier (pay) 2 29.50 8.74“ 0 
Bi: Dp versus Dx 1 45 13 
Bi: Dx versus Ds 1 33.36 9.90** 
Interaction 4 | 2.34 69 .00 
Error 143 | 3.37 


Note.—S and D N the satisfier and dissatisfier 
factors, respectively: Subscripts D, N. and S represent the dis- 
pee neutral, and satisfied levels, respectively. 


* S 01. 


demonstrated an increasing monotonic trend 
for groups dissatisfied, neutral, and satisfied 
with the job variables. This relationship, pre- 
dicted by traditional theory, was shown in 
both samples by work itself, promotion, and 
pay. Of the six relationships tested for curvi- 
linearity, only that between work itself and 
satisfaction for females was significantly curvi- 
linear, However, the trend in means on satis- 
faction for this single curvilinear relationship 
was opposite that predicted by two-factor 
theory. Although two-factor theory predicted 
no difference in satisfaction between the 
groups dissatisfied and neutral with regard to 
work itself, the largest difference was between 
these very groups. Thus, predictions made by 
traditional theory in Hypotheses 1 and 2 were 
supported by these data. Moreover, these data 
clearly are not compatible with two-factor 
predictions of nonlinear relationships be- 
tween satisfaction with job variables and 
overall satisfaction. 

The results relevant to Hypothesis 3 are 
the measures of strength of association 
(Omega-squared ). These results strongly sup- 
port the hypothesis that content variables 
(work itself and promotion) are more highly 
related to overall satisfaction—dissatisfaction 
than context variables (pay). In the male 
sample, work itself and promotion accounted 
for 12% of the total variance in overall satis- 
faction, and pay accounted for 3% and 5%. 


TABLE 5 


MEANS on Ovesatt Jon SATISFACTION FOR THE 
Groups Dissatisrizp, NEUTRAL, AND 
Sarisyrep witu REGARD TO Wok 
IrseLr, Pay, AND PROMOTIO! 


J | Work | Pay _ | Promotion 
Group | — — 
n M n | M | n | M 
Males E e Al 1 
Dissatisfied 12 6.17 | 21| 7.19 4 7.14 
Neutral 18 7.17 % 7.79 | 33 | 812 
Satisfied 137 be | 117 | 8.43 | 90 | 868 
Females | | | } 
Dissatisfied | 19 5.21“ 18 | 689 | 61 | 7.36 
Neutral 26 | 7.73 | 22 | 709 | 27 | 8.11 
Satisfied | 107 | 8.89 | 112 | 8.67 | 64 | 9.11 


ee AIT 


Note.—Standard deviations on overall satisfaction were 1.73 
and 2.07 for males and females, e 
* Significantly nonlinear at $ < 01. 


Similarly, in the female sample, work itself 
accounted for 279% of the total variance in 
satisfaction, promotion accounted for 10%, 
and pay for 5% and 8%. Further evidence of 
the large differences in strength of relation- 
ship can be seen in the differences between 
means shown in Table 5. 

Results on Hypothesis 4 are the ANOVA 
tests on interactions. According to Tables 1 
through 4, none of the interactions between 
content and context dimensions was signifi- 
cant. Therefore, the hypothesis that the func- 
tioning of context variables depends on the 
level of satisfaction with the content variables 
was not supported by these data. 


Discussion 


The results of this study, taken as a whole, 
must be viewed as damaging to the two-factor 
theory. Although failure by two-factor theory 
to predict four out of six relationships seems 
discouraging enough, three of these four 
misses actually were opposite that predicted. 
In addition, the trends of the means on over- 
all satisfaction for the groups dissatisfied, 
neutral, and satisfied with the job variables 
were increasing monotonic and essentially 
linear, rather than nonlinear as predicted 
by two-factor theory. In contrast, hypotheses 
of traditional theory generally were supported 
in this study. The prediction of increasing 
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linear trends in satisfaction for groups dis- 
satisfied, neutral, and satisfied with regard 
to job variables was supported for both the 
content variables of work itself and promo- 
tion and the context variable of pay. These 
findings held for both males and females with 
a single exception. 

This study also supports the conclusions 
of Friedlander (1964), Wernimont (1966), 
Ewen et al. (1966), and Graen (1966) that 
content (intrinsic) variables are more strongly 
related to overall satisfaction—dissatisfaction 
than are context (extrinsic) variables. In the 
previous study by Graen (1966) the content 
variable accounted for 18% and the context 
variable 2% of the total variance in overall 
satisfaction. In the present study, the content 
variables accounted for between 10% and 
27% and the context variable accounted for 
between 3% and 8%. 

In agreement with Graen (1966), this study 
failed to yield a significant interaction be- 
tween content and context variables. Thus, the 
hypothesis by Ewen et al. (1966) that the 
functioning of extrinsic variables depends on 
the level of satisfaction with the intrinsic 
variables finds no support in this study. 

One major difference in results appeared be- 
tween this replication study and the original 
studies by Ewen and Graen. The sample em- 
ployed by Ewen and Graen consisted of male 
employees selected to represent the popula- 
tion of industrial concerns employing 50 
or more persons (Kendall, 1963). In those 
studies, the content variable (a composite of 
work itself and promotion) and the context 
variable (pay) contributed significantly to 
both high and low overall satisfaction. In the 
male sample of the present study, work itself 
contributed only to high satisfaction, promo- 
tion only to low satisfaction, and pay only to 
high satisfaction. The main difference be- 
tween these two sets of results was that pay 
failed to contribute to low satisfaction in the 
present study. This difference probably was 
due to the situational variables associated 
with a single company. 

Even within the same company, this study 
found large sex differences. In the sample of 
female employees, work itself contributed to 
both high and low overall satisfaction, and 
promotion and pay only to high satisfaction. 
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In contrast, for males working in the same 
location, promotion contributed only to low 
satisfaction, and work itself and pay only to 
high satisfaction. 

This study illustrates once again that the 
major hypotheses of the two-factor theory, in 
their existing form, cannot predict results pro- 
duced by the present method. Although two- 
factor hypotheses summarize the results pro- 
duced by the story-telling method very well, 
they appear useless in predicting results of 
studies employing other methods. For docu- 
mentation on this point, see Dunnette, Camp- 
bell, and Hakel (1967). Unless the two-factor 
theory can be modified so as to account for 
the disconfirming results obtained when dif- 
ferent methods have been employed it should 
be regarded as of limited usefulness in the 
attempt to understand work motivation. 
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Proposes a new emphasis in the prediction of future job behavior. The essence 
of the suggested procedure is the establishment of consistencies between relevant 
dimensions of job-behavior and preemployment-behavior samples obtained from 
real or simulated situations. If samples instead of signs are employed, a number 
of prediction and measurement problems seem to be alleviated or at least 
confronted more directly. An emerging technology of behavior sampling and 
measurement would appear to put these goals within reach. 


Many writers (e.g, Dunnette, 1963; 
Ghiselli & Haire, 1960; Guion, 1965; Wallace, 
1965) have expressed concern about the dif- 
ficulties encountered in trying to predict job 
performance, and in establishing the validity 
of tests for this purpose. In general, their mis- 
givings center around the low validities ob- 
tained and misapplications of the so-called 
“classic validity model.” To help ameliorate 
these difficulties it is proposed here that the 
concept of validity be altered as it is now 
applied to predictive and concurrent situations 
and introduce the notion of “behavioral con- 
sistency.” By consistency of behavior is meant 
little more than that familiar bit of conven- 
tional wisdom, “The best indicator of future 
performance is past performance.” Surpris- 
ingly few data seem to exist to either support 
or refute this generalization. It deserves con- 
siderably more attention. 


Some History 


It is perhaps not too difficult to trace the 
steps by which applied psychologists arrived 
at their present situation. During both World 
War I and World War II general intelligence 
and aptitude tests were effectively applied to 
military personnel problems. Largely as the 
result of these successes, the techniques de- 
veloped in the armed services were trans- 
ported to the industrial situation and applied 
to the personnel problems of the business or- 
ganization. From a concentration on global 
measures of mental ability, validation efforts 
branched out to include measures of specific 
aptitudes, interests, and personality dimen- 
sions. The process is perhaps most clearly il- 
lustrated by the efforts of the United States 
Employment Service to validate the General 


Aptitude Test Battery across a wide range 
of jobs and occupations. In general, testing 
seemed to be a quick, economical, and easy 
way of obtaining useful information which 
removed the necessity for putting an individ- 
ual on the job and observing his performance 
over a trial period. 

It was in the context of the above efforts 
that an unfortunate marriage occurred, 
namely, the union of the classic validity 
model with the use of tests as signs, or in- 
dicators, of predispositions to behave in cer- 
tain ways (Cronbach, 1960, p. 457), rather 
than as samples of the characteristic behavior 
of individuals. An all too frequent procedure 
was to feed as many signs as possible into 
the classic validity framework in hopes that 
the model itself would somehow uncover 
something useful. The argument here is that 
it will be much more fruitful to focus on 
meaningful samples of behavior, rather than 
signs of predispositions, as predictors of later 
performance. 


THE CONSISTENCY MODEL 


To further illustrate the point, consider a 
hypothetical prediction situation in which the 
following five measures are available: 


1. Scores on a mental ability test; 

2. School grade-point average (GPA); 
3. Job-performance criterion at Time 1; 
4. Job-performance criterion at Time 2; 
5. Job-performance criterion at Time 3. 


Obviously, a number of prediction oppor- 
tunities are possible. Test scores could be cor- 
related with GPA; school achievement could 
be correlated with first-year job success; oF 
the test scores and GPA could be combined 
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in some fashion and the composite used to 
predict first-, second-, or third-year job per- 
formance. All of these correlations would be 
labeled validity coefficients and all would 
conform to the classic validity model. It is 
less clear what label should be attached to 
the correlation between two different measures 
of job performance. Few would call it validity; 
many would probably refer to it as relia- 
bility. There seems to be a tendency among 
applied psychologists to withhold the term 
validity from correlations between measures 
of essentially the same behavior, even if they 
were obtained at two different points in time. 
That is, the subtleties of the concept of re- 
liability and the ingredients of the classic 
validity model seem to have ingrained the 
notion that validity is a correlation between 
a predictor and a criterion and the two should 
somehow be dissimilar. 

However, each of the 10 correlations that 
one could compute from the above situation 
represents the degree of common variation 
between the two variables, given the ap- 
propriateness of the linear correlation model. 
After all, that is what correlation is all about. 
In this sense there is no logical reason for 
saying that some of the coefficients represent 
validity and others reliability, although there 
certainly may be in other contexts. An im- 
plicit or explicit insistence on the predictor 
being “different” seems self-defeating. Rather 
one should really be trying to obtain measures 
that are as similar to the criterion or criteria 
as possible. This notion appears to be at least 
implicit in much of the work on prediction 
with biographical data where many of the 
items represent an attempt to assess previous 
achievement on similar types of activities. Be- 
havior sampling is also the basis on which 
simulation exercises are built for use in man- 
agerial assessment programs. p 

At this point it should be emphasized 
that for the consistency notion to be con- 
sistent, the measures to be predicted must 
also be measures of behavior. For example, 
it would be something less than consistent 


criteria as salary progression, 
level achieved, or subunit production. The 
individual does not always have substantial 
control over such variables, and, even with 
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the more obvious biasing influences accounted 
for, they place u ceiling on the maximum 
predictive efficiency to be expected, Further- 
more, they are several steps removed from 
actual job behavior. In this respect, the au- 
thors are very much in accord with Dunnette 
(1966) who argues strongly for the measure- 
ment of observable job behavior in terms of 
its effect on meaningful dimensions of per- 
formance effectiveness. A recently developed 
method for accomplishing this aim is the be- 
havior retranslation technique of Smith and 
Kendall (1964). The applied psychologist 
should reaffirm his mandate and return to 
the measurement of behavior. Only then will 
one learn by what means, and to what extent, 
an individual has influenced his rate of pro- 
motion, salary increases, ot work group's 
production. 

In general terms, what might the selection 
or prediction procedure look like if one tried 
to apply a consistency model? First, a com- 
prehensive study of the job would be made. 
The results of this effort would be in the 
form of dimensions of job performance well 
defined by a broad range of specific behavior 
incidents which in turn have been scaled 
with respect to their “criticalness” for effec- 
tive or ineffective performance. 

Next, a thorough search of each applicant’s 
previous work experience and educational his- 
tory would be carried out to determine if any 
of the relevant behaviors or outcomes have 
been required of him or have been exhibited 
in the past. Items and rating methods would 
be developed to facilitate judging the fre- 
quency of such behaviors, the intensity with 
which they were manifested, the similarity of 
their context to the job situation, and the 
likelihood that they will show up again. These 
judgments can then be related to similar 
judgments concerning significant and con- 
sistent aspects of an individual’s job behavior. 

Such a procedure places considerable em- 
phasis on background data and is similar in 
form to the “selection by objectives” concept 
of Odiorne and Miller (1966). However, the 
aim is to be considerably more systematic and 
to focus on job behavior and not summary 
“objectives.” 

After the analysis of background data it 
might be found that the required job be- 
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haviors have not been a part of the applicant’s 
past repertoire and it would be necessary to 
look for the likelihood of that job behavior 
in a variety of work-sample tests or simula- 
tion exercises. A number of such behavior 
measures are already being used in various 
management assessment programs. 

Finally, individual performance measures 
of psychological variables would be given 
wider use where appropriate. For example, 
the Wechsler Adult Intelligence Scale (Wechs- 
ler, 1955) might be used to assess certain 
cognitive functions. Notice that such a mea- 
sure is a step closer to actual performance 
sampling than are the usual kinds of group 
intelligence tests. 

How does the above procedure compare to 
conventional practice? The authors hope 
they are not beating at a straw man if the 
usual selection procedure is described as fol- 
lows. First, a thorough job analysis is made 
to discover the types of skills and abilities 
necessary for effective performance. This is 
similar to the consistency approach except 
that the objective seems to be to jump very 
quickly to a generalized statement of skills 
and abilities rather than remaining on the be- 
havioral level. The conventional approach next 
entails a search for possible predictors to try 
out against possible criteria. Based on knowl- 
edge of the personnel selection and individual 
differences literature, personal experience, and 
“best guesses,” some decisions are made con- 
cerning what predictors to include in the 
initial battery. It is the authors’ contention 
that the classic validity model has forced an 
undue amount of attention on test and in- 
ventory measures at this stage. Witness the 
large amount of space devoted to a discus- 
sion of “test validation” in most books deal- 
ing with the selection problem. Again, signs 
seem to take precedence over samples. Lastly, 
one or more criterion measures are chosen. 
Too often the choice seems to be made with 
little reference to the previous job analysis 
and is based on a consideration of “objec- 
tivity” and relevance to the “ultimate” cri- 
terion. Unfortunately, even a slight misuse of 
these considerations can lead to criteria which 
are poorly understood. In contrast, working 
within the framework of a consistency model 
requires consideration of dimensions of actual 
job behavior. 
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It might be added that the above charac- 
terization of the conventional approach is 
meant to be somewhat idealized. Certain de- 
partures from the ideal might reinforce the 
use of signs to an even greater extent. For 
example, there is always the clear and present 
danger that the skill requirements will be 
stated in terms of “traits” (e.g., loyalty, re- 
sourcefulness, initiative) and thus lead even 
more directly to criteria and predictors which 
are oriented toward underlying predisposi- 
tions. 


RELATIONSHIP TO OTHER ISSUES 


The consistency notion has direct relevance 
for a number of research issues that appear 
frequently in the selection and prediction 
literature. One important implication is that 
selection research should focus on individuals 
to a much greater extent than it has. That is, 
there should be more emphasis on intrain- 
dividual consistency of behavior. In their in- 
sightful discussion of the criterion problem, 
Ghiselli and Haire (1960) point out that in- 
traindividual criterion performance sometimes 
varies appreciably over time, that is, is “dy- 
namic.” They give two examples of this 
phenomenon. However, after an exhaustive 
review of the literature, Ronan and Prien 
(1966) concluded that a general answer to 
the question, “Is job performance reliabie?” 
is not really possible with present data. They 
go on to say that previous research has not 
adequately considered the relevant dimensions 
that contribute to job performance and very 
few studies have actually used the same 
criterion measure to assess performance at 
two or more points in time. In the absence of 
much knowledge concerning the stability of 
relevant job behaviors it seems a bit danger- 
ous to apply the classic validation model and 
attempt to generalize from a one-time cri- 
terion measure to an appreciable time span 
of job behavior. Utilizing the consistency no- 
tion confronts the problem directly and forces 
a consideration of what job behaviors are 
recurring contributors to effective perform- 
ance (and therefore predictable) and which 
are not. 

In addition, the adoption of signs as pre- 
dictors in the context of the classic model 
has undoubtedly been a major factor con- 
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tributing to the lack of longitudinal research. 
It makes it far too easy to rely on concurrent 
studies, and an enormous amount of effort 
has been expended in that direction, Em- 
phasis on behavior samples and behavior con- 
sistency requires that a good deal more at- 
tention be devoted to the former, along with 
very explicit consideration of the crucial 
parameters of a longitudinal study. 

The moderator or subgrouping concept also 
seems an integral part of the consistency ap- 
proach. The basic research aim is to find 
subgroups of people in a particular job family 
for whom behavior on a particular perform- 
ance dimension is consistent. Subgrouping 
may be by individual or situational charac- 
teristics but the necessity is clear and in- 
escapable. Only within such subgroups is 
longitudinal prediction possible. 

Lastly, the process the authors are ad- 
vocating demands a great deal in terms of 
being able to specify the contextual or situa- 
tional factors that influence performance. It 
is extremely important to have some knowl- 
edge of the stimulus conditions under which 
the job behavior is emitted such that a more 
precise comparison to the predictor behavior 
sample can be made. Because of present dif- 
ficulties in specifying the stimulus conditions 
in an organization (e.g., Sells, 1964), this may 
be the weakest link in the entire procedure. 
However, it is also a severe problem for any 
other prediction scheme, but is usually not 
made explicit. 

It is important to note that the authors’ 
notion of a consistency model does not rest 
on a simple deterministic philosophy and is 
not meant to preclude taking account of so- 
called “emergent” behaviors. Relative to “crea- 
tivity,” for example, the question becomes 
whether or not the individual has ever ex- 
hibited in similar contexts the particular kind 
of creative behavior under consideration. If a 
similar context never existed, the research 
must investigate creative performance and 
outputs obtained in a test situation which 
simulates the contextual limitations and re- 
quirements in the job situation. 

An additional advantage of the consistency 
approach is that a number of old or persistent 
problems fortunately appear to dissipate, or 
at least become significantly diminished. Con- 
sider the following: 
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1. Faking and response sets—Since the em- 
phasis would be on behavior samples and not 
on self-reports of attitudes, beliefs, and inter- 
ests, these kinds of response bias would seem 
to be less of a problem. 

2. Discrimination in testing—According to 
Doppelt and Bennett (1967) two general 
charges are often leveled at tests as being 
discriminatory devices: 


(a) Lack of relevance—It is charged that 
test items are often not related to the work 
required on the job for which the applicant is 
being considered, and that even where rela- 
tionships can be shown between test scores 
and job success there is no need to eliminate 
low-scoring disadvantaged people since they 
can be taught the necessary skills and knowl- 
edge in a training period after hiring. 

(b) Unfairness of content—It is further 
maintained that most existing tests, especially 
verbal measures, emphasize middle-class con- 
cepts and information and are, therefore, un- 
fair to those who have not been exposed to 
middle-class cultural and educational in- 
fluences. Consequently, the low test scores 
which are earned are not indicative of the 
“true” abilities of the disadvantaged. Predic- 
tions of job success made from such scores 
are therefore held to be inaccurate. 


The examination of past behaviors similar 
in nature to desired future behavior, along 
with their contextual ramifications, plus the 
added techniques of work samples and simula- 
tion devices encompassing desired future be- 
havior, should markedly reduce both the real 
and imagined severity of problems of unfair- 
ness in prediction. 

3. Invasion of privacy The very nature of 
the consistency approach would seem to al- 
most entirely eliminate this problem. The link 
between the preemployment or prepromotion 


‘behavior and job behavior is direct and ob- 


vious for all to see. 


CoNCLUDING COMMENTS 


The preceding discussion is meant to be 
critical of the concepts of predictive and con- 
current validity. Nothing that has been said 
here should be construed as an attack on 
construct validity, although Campbell (1960) 
has pointed out that reliability and validity 
are also frequently confused within this con- 
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cept. Neither do the authors mean to give the 
impression that a full-scale application of the 
consistency model would be without difficulty. 
Using available criteria and signs of assumed 
underlying determinants within the frame- 
work of the classic model is certainly easier; 
however, for long-term gains and the eventual 
understanding of job performance, focusing on 
the measurement of behavior would almost 
certainly pay a higher return on investment. 

Some time ago, Goodenough (1949) di- 
chotomized this distinction by referring to 
signs versus samples as indicators of future 
behavior. Between Hull’s (1928) early state- 
ment of test validities and Ghiselli’s (1966) 
more recent review, almost all research and 
development efforts have been directed at 
signs. Relatively small benefits seem to have 
resulted. In contrast, some recent research 
efforts directed at samples seem to hold out 
more promise. The AT&T studies, which used 
ratings of behavior in simulated exercises 
(Bray & Grant, 1966), and the In-basket 
studies reported by Lopez (1965) are suc- 
cessful examples of employing behavior sam- 
ples with management and administrative 
personnel. Frederiksen (1966) has reported 
considerable data contributing to the construct 
validity of the In-basket. In addition, Ghiselli 
(1966) has demonstrated that an interview 
rating based on discussion of specific aspects 
of an individual’s previous work and educa- 
tional history had reasonably high validity, 
even under very unfavorable circumstances. 
In a nonbusiness setting, Gordon (1967) 
found that a work sample yielded relatively 
high validities for predicting final selection 
into the Peace Corps and seemed to be largely 
independent of the tests that were also in- 
cluded as predictors, 


Hopefully, these first few attempts are the 


beginning of a whole new technology of be- 


havior sampling and Measurement, in both 
real and simulated situations. If this tech- 
nology can be realized and the consistencies of 
various relevant behavior dimensions mapped 
out, the selection literature can cease being 
apologetic and the prediction of performance 
will have begun to be understood. 
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EFFECT OF DEFECT RATE ON INSPECTION ACCURACY 
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80 inspectors inspected materials containing defects at 4 different rates. The 
inspection task employed had been shown in a series of previous studies to 
be representative of a variety of visual inspection tasks. Performance measures 
were obtained and analyzed to determine the relationship between defect rate 
and inspection accuracy. Findings indicated that inspection accuracy decreases 
with reductions in defect rate. The percentage of true defects detected sig- 
nificantly decreased and the percentage of false reports significantly increased. 


Although product inspections are widely 
relied upon in industrial operations to provide 
information used in the control of product 
quality, relatively little is known about the 
conditions that affect inspection accuracy. 
There are a number of factors in the inspec- 
tion task which, under certain circumstances, 
may operate to increase or decrease the ac- 
curacy of the resulting quality information. 
For example, Harris (1966) found that the 
complexity of the inspection task had a sig- 
nificant negative effect on inspection accuracy. 
Even when an unlimited amount of inspection 
time was provided, the percentage of de- 
fects detected was significantly negatively 
correlated with the complexity of the item 
inspected. 

The objective of this study was to deter- 
mine the effect of defect rate on inspection 
accuracy. Most production programs have a 
typical learning curve associated with them. 
Early in the program product defect rate is 
high, but quality typically improves, until 
later on the defect rate is relatively low. 
Other circumstances, such as variations in 
production schedules, may also cause the de- 
fect rate to vary in a predictable way. Should 
inspections be conducted in the same manner 
regardless of the defect rate? This depends, 
of course, on the relationship between defect 
rate and inspection accuracy. 

Since visual inspection tasks have some 
elements in common with visual monitoring 
tasks requiring the detection of infrequent 
signals appearing at irregular intervals, previ- 
ous research on vigilance performance may 
suggest the general nature of this relationship. 
One would predict from vigilance research 
that inspection accuracy would decrease with 


a decrease in defect rate (Buckner & Mc- 
Grath, 1963). However, there are differences 
between typical production inspection tasks 
and the monitoring tasks which have ap- 
peared in the studies on vigilance performance. 
Consequently, there is little in vigilance re- 
search results to indicate the specific nature 
of the relationship between defect rate and 
accuracy for an inspection task. 


METHOD 


A representative inspection task was developed 
and materials prepared which included defects at 
four different defect rates. Materials at each defect 
rate were inspected by a sample of naïve Ss. Inspec- 
tion accuracy was measured in terms of the per- 
centage of defects detected and the percentage of 
false reports made. 


Inspection Task 


A paper-and-pencil inspection task was developed. 
The task was similar to that employed in a previ- 
ously developed selection test for visual inspection 
work (Harris, 1964). In a series of validation 
studies, performance on this task was found to be 
significantly correlated with performance on a wide 
variety of visual inspection tasks. The correlations 
are listed in Table 1. 


TABLE 1 


CORRELATIONS BETWEEN PERFORMANCE ON Test TASK 
AND VARIOUS INSPECTION TASKS 


N Correlation 
Inspection task 1. 8. with test 

inspectors task 
Electronic chassis 11 39 
Inertial instruments 8 .86* 
Module assemblies 19 58% 
Circuit boards 27 .64* 
Microelectronic devices 10 78* 
Photomasks 15 82 


— o o o —— 


* <.05. 
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Fic. 1. Sample inspection sheet with defects indicated. 


The inspection task consisted of detecting defects 
in sheets like the one shown in Figure 1. The en- 
closed area contained 80 items: 20 each of the four 
types shown in their correct form and orientation 
at the side of the sheet. Defective items were in- 
dicated by an X made over them with a colored 
pencil. 

Four different 40-sheet sets were prepared, one 
for each of the four different defect rates. The layout 
of each sheet was identical to the one shown in 
Figure 1. The starting point in the preparation of 
materials was drafting a defect-free sheet and print- 
ing 160 copies of it. Then a basic set of 32 defective 
items was drafted and copies printed. Defects were 
inserted by gluing defective items over appropriate 
items on the defect-free sheets. From these masters, 
20 copies of each 40-sheet set were printed. 

Defects were randomly assigned to form four 40- 
sheet sets with defect rates of 0.25%, 1%, 4%, and 
16%. Since each set contained a total of 3,200 items 
(40 sheets X 80 items per sheet), these percentages 
correspond to 8, 32, 128, and 516 defects per set of 
40 sheets. In preparing the 1%, 4%, and 16% defect 
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rates, the 32 defective items were assigned once, 4 
times, and 16 times to the 40-sheet set. For the 0.25% 
defect rate, four subsets of 40 sheets were each as- 
signed 8 of the basic set of 32 defective items. In 
this manner, each defective item appeared an equal 
number of times under each defect-rate condition 
A table of random numbers was used to assign de- 
fects to sheets. 


Procedure 


Inspections were made under the four defect-rate 
conditions by a total of 80 naive inspectors. In- 
spectors were randomly assigned to the 4 conditions, 
20 per condition. The inspectors were paid student 
volunteers from California State College at Fullerton 
and Fullerton Junior College. Each inspector was 
given samples of the types of defects to be de- 
tected and instructed to X over each defect found 
on each sheet. Inspectors were encouraged to take 
as much time as necessary to perform accurate in- 
spections. They were given no instructions about the 
number of defects to expect; they were instructed, 
however, to ask for clarification on any item about 
which they were undecided. In this manner, the 
monitor performed a role similar to an inspection 
supervisor on the job. 

Inspection accuracy was measured in terms of de- 
fects detected and false reports made. For each in- 
spection condition, the number of defects detected 
was divided by the number of defects present to give 
the percentage of defects detected. The percentage of 
false reports was computed by dividing the number 
of defects reported which were not actually defects 
by the total number of defects reported. 


RESULTS AND Discussion 


Inspection accuracy decreased with reduc- 
tions in defect rate. As shown in Figure 2, 
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PERCENTAGE 
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Fic. 2. Relationship between defect rate and 
inspection accuracy, 
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the percentage of defects detected decreased 
slightly between defect rates of 16% and 1%, 
but dropped sharply between 1% and 0.25%. 
False reports, the second indicator of inspec- 
tion accuracy, became more frequent with 
reductions in defect rate. The percentage of 
false reports increased at an accelerated rate 
as the defect rate approached zero. The dif- 
ferences among the four defect rates for the 
percentages of defects detected and for the 
percentages of false reports were tested by 
means of the Kruskal-Wallis one-way analysis 
of variance by ranks (Siegel, 1956). Differ- 
ences were statistically significant beyond the 
.05 level for both the percentages of defects 
detected and percentages of false reports. 
These results demonstrate a potential prob- 
lem in using information obtained from visual 
inspections. When a relatively high quality 
product is being produced (the percentage of 
defective characteristics is about 1 or less), 
inspections may not be very accurate. The 
percentage of defects detected is likely to be 
low and the percentage of false reports is 
likely to be high. As a consequence, inferences 
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made about the true quality level from visual 
inspections may be in error. 

These findings have implications for the 
management of inspection operations. Super- 
visors should be aware that this tendency 
exists and, thus, be better prepared to counter- 
act the effect of this factor. For example, as 
quality level increases, considerations might 
be given to fewer, more intensive inspections 
made through the use of sampling plans. Much 
better information might be obtained at less 
cost, for example, from inspections of a 10% 
random sample of the product than from in- 
spections of 100% of the product. 
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PREFERENCE SCALING OF CAREERS AND ORGANIZATIONS' 
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College of Business Administration, University of Rochester 


Paired-comparison scaling procedures were applied to an investigation of 
preferences for 5 types of careers and 5 types of organizations. Scaling was 
accomplished for 10 work-satisfaction criteria, using 4 samples, 2 of college 
students and 2 of college preparatory high school students. The median correla 
tion of preferences between samples was .75; internal consistencies of 
preferences were higher for college students. The order of preferences for 
both careers and organizations depended upon the criteria. 


tional interests can be fairly well measured, 


The time-worn debate between the “busi- 
nessman” and the “humanist” regarding the 
virtues of a business career continues to ap- 
pear in the popular press. As usual, both as- 
sert that their point of view conforms to 
current student preferences (Blough, 1966; 
Drucker, 1965). This assertion is possible 
because accurate data bearing on the question 
have not been available, and traditional voca- 
tional preference theory has not addressed 
itself directly to the problem. 

For today’s college student this debate may 
be more practical than at any time in the 
past. Employment opportunities in teaching, 
government, research, and social service have 
expanded greatly since 1929 (Fuchs, 1965). 
As the first step in research on this problem, 
this study attempts to assess student prefer- 
ences for a limited number of careers, using 
10 criteria, and to also assess preferences for 
the type of organization in which these careers 
might be practiced. 

Most vocational preference theory tries to 
account for the development of “interests”; 
for example, perhaps a graduating senior 
wants to become an engineer because he 
enjoyed playing by himself with an erector 
set rather than playing doctor with the little 

girl next door. This theory often focuses on 
the biographical antecedents of measured in- 
terest patterns; for example, a current area 
of research studies the development of a 
“self” concept, an increasing awareness of 
one’s own role (Hunt, 1967). Although voca- 


1This study was supported by a grant from 
Security Trust Corporation, Rochester, New Vork. 
Parts of this paper were presented at the meeting of 
the Midwestern Psychological Association, Chicago, 
May 1967. 
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neither the theories nor empirical research 
have resulted in a clear statement of the de- 
cision process which a student undertakes as 
gtaduation nears (Holland, 1964). 

This study attempts to measure attitudes 
which may affect career decisions by deter- 
mining whether widely held preferences exist 
for careers and organization types in which 
careers may be pursued. It is hypothesized 
that, for the same criteria, preference patterns 
for business organizations may be consider- 
ably different than those for a business ad- 
ministration career. 

The issue of preferences raises the question 
of the basis of career decisions. One model 
which has been proposed by economists as- 
serts that (assuming people have appropriate 
information) career decisions are based on an 
assessment of the present value of future 
earnings of alternatives at the time the career 
decision is made (Wilkinson, 1966). The 
present value model (in its simplest form) 
states that when people are nearly indifferent 
in their estimates of nonmonetary rewards, 
such as a desire for power or leisure, they 
choose a career on the basis of higher ex- 
pected earning power. A greater difference in 
monetary rewards is needed in order for the 
decision to be swung by the monetary factor, 
the less the indifference to the nonmonetary 
factors. Empirical evidence on individual 
career decisions does not exist to support or 
deny the idea that people behave in a fashion 
similar to that proposed above. Data show 
that aggregate shifts in numbers choosing 
careers usually follow shifts in salary offers. 
However, the question as to how individuals 
actually make their decisions is not answered 


adequately since the data are purely post hoc 
for the aggregate. For present purposes it will 
be assumed that people are able to provide 
a qualitative ranking which includes responses 
intrinsic to the work activity and the mone- 
tary reward therein. 

For example, it could be argued that each 

person is able to estimate (qualitatively) 
some just noticeable difference in income. 
Within the range of this income difference, 
nonmonetary matters are used to decide be- 
tween alternative career patterns. Outside this 
range, trade offs between income and other 
criteria may be made. For some people it may 
be that possible choices, after income is taken 
into account, will be very wide. However, 
very accurate information on how these trade 
offs are made is not available. 
This study is set up to measure separately 
the preferences for careers and organizations 
on a basis of 10 criteria related to job satis- 
faction. 


METHOD 

Criteria 

Ten criteria for comparing careers and organiza- 
tions were established after cohsideration of a num- 
ber of reviews of the job-satisfaction literature 


(Bass, 1965; Herzberg, 1966; Vroom, 1964). The 
criteria are: 


(a) there are better opportunities to accomplish 
something worthwhile. 

(b) there are better advancement possibilities. 

(c) there will be more situations where you can 
apply what you learned in school. 

(d) your success would depend least on lucky 
circumstances. 

(e) you would earn the highest salary. 

(f) you would have more important responsibili- 
ties. 

(g) you would be most recognized for your con- 
tributions. 

(h) you would receive supervision from most 
competent superiors, 

(i) you would have the most secure future, with 

steady employment. 

(i) you would have the best chance of working 
with people with whom you could build last- 
ing friendships. 


The 10 criteria were added to the following pair of 
questions: 


“On your first job in which [type of organization] 
ily you think [Criteria a through I].“ 

On your first job doing which [type of job] do 
you think [Criteria a through II.“ 
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Paired-Comparison Instrument 


The following is a list of organizations and careers 
which were compared. The careers are ones which 
bright students may consider, and usually involve 
graduate school. 


CAREERS 
Business Administrator 
Lawyer 

Research Scientist 
Professor 

Clinical Psychologist 


ORGANIZATIONS 


Federal Government 
College Faculty 
Small Business 
Non-Profit Service 
Large Corporation 


The exact wording of every question asked will be 
found in Tables 1 and 2. Given these criteria and the 
above list of careers and future employers, a paper- 
and-pencil form containing a series of 20 blocks of 
10 paired-comparison items was developed. All 
possible pairs of either careers or organizations were 
included in a block. Each member of a pair was 
placed under columns designated “A” or “B,” fol- 
lowed by a blank space under a column labeled 
“choice.” The order of pairs within a block was de- 
termined by using a table of random numbers. The 
10 organizational comparison blocks were followed 
by the 10 career comparison blocks. Two sample 
pairs using items not included in the actual com- 
parisons were included in the instructions, The Ss 
were instructed to work rapidly. Most completion 
times, including response to a biographical informa- 
tion form, were between 30 and 40 min. This in- 
strument is at present being used in a number of 
validation studies. 


Subjects 


The instrument was administered to four samples 
of Ss. Two of the groups were high school seniors 
(n= 72 and 25) who have indicated their intent to 
go to college, The high schools were located in 
middle-class suburban communities. The third group 
consisted of 116 freshmen students at a medium- 
sized eastern university with a highly selective ad- 
missions policy who were enrolled in a freshman 
psychology course. A fourth sample consisted of 69 
upperclassmen at the same university enrolled in an 
experimental psychology and a statistics course. 


RESULTS 
Computation of Preference Values 


The individual preferences were evaluated 
according to a procedure suggested by Guil- 
ford (1954).? 


2A program to perform these computations writ- 
ten by W. R. Johnson for the IBM 7074 is available 
from the authors. 


wow 
* 
N 


DANIEL N. BRAUNSTEIN 


AND GEORGE H. HAINES 


TABLE 1.—PREFERENCE 


Federal Government 


| High school 


Question 


College faculty 


| University | High school Unive 


| Sample 1| Sample 
| 


a) On your first job, in which do you think | 
there are better opportunities to accom- 
plish something worthwhile? 

(b) On your first job in which do you think 
there are better advancement possi- 
bilities? 

(c) On your first job, in which do you think 
there will be more situations where you 
can apply what you learned in school? 

(d) On your first job, in which do you think 
your success would depend least on 
lucky circumstances? 

(e) On your first job, in which do you think 
you would earn the hi. t salary? 

U) On your first job, in which do you think 
you would have more important re- 
sponsibilities? 5 

(£) In your first job. in which do you think 
you would be most recognized for your 
contributions? 

(% In your first job, in which do you think 
you would receive supervision from most 
competent superiors? 

% In your first job, in which do you think 
you would have the most secure future, 
with steady employment? 

(Q) In your first job, in which do you think 
you would have the best chance of 
working with people with whom you 
could build lasting friendships? 


061 


156 


406 


—.803 


—.965 


Fresh 
men 


Upper 
classmen 


| | 


| Upper- 
classmen 


2| Fresh- 


Sample 1| Sample 2 


302 | .229 


—.593 577 


Note. — Data are Z-score values representing the average preference for each item compared to all other items. A zero (neutral) 


probability of preference, when compared to the other items. 


High school: Sample 1 n = 72; Sample 2 x = 25. University: Freshmen n = 116, Upperclassmen n = 69, 


The average proportion of preferences for 
every career and organization was calculated. 
This average proportion was converted to a 
scale on a standard score basis, with 0 as a 
neutral point. Positive scale values indicated 
a higher proportion of preferences than 33 
negative, a lower proportion. 


Preference Values for Organizations 


The data for the organizational preferences 
are given in Table 1. There are considerable 
differences between the criteria as to the type 
of organization most preferred. Federal Goy- 
ernment is generally preferred for the security 
criterion; college faculty for accomplishment, 
application of school work, and least depend- 
ence on luck; large corporation for advance- 
ment and salary; non-profit service and small 
business shared preferred positions for the 
recognition and social criteria, 


Preference Values for Careers 


The data for the career preferences are 
given in Table 2. Business administrator is 


strongly preferred for its advancement op- 
portunities. Among the high school students, 
lawyer is viewed as offering high starting 
salaries, while the college Ss’ preferences for 
lawyer are lower on the salary criterion. Re- 
search scientist is preferred for recognition 
and competent supervision and, by all except 
the college upperclassmen, for accomplish- 
ment. The upperclassmen strongly prefer clin- 
ical psychologist for the accomplishment and 
responsibility; it is not highly preferred by 
the others (the upperclassmen in the present 
sample were psychology majors). College pro- 
fessor is strongly preferred, surprisingly, on 
the social criterion. 


Internal Consistency 


If such an obtained scale of preference 
values is to be maximally useful, one must as- 
sume that the scale results can be used as if 
the so-called “strong axiom of revealed pref- 
erence” held. That is, if A is preferred to B 
and B to C, then C ought not to be preferred 
to A. If it is, this triad would be inconsistent. 


4 


| 
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à 


Large corporation 


Non-profit service organization | 


Small business 


= "i = —ͤ — — N 
High school | University Highschool | University High schoot | University 
| — — — a ia] — 
| Fresh- | U ; z Io 

* sh- | Upper- * | Fresh- | Up | : Fresh- | Upper 

ai Sample 2 men claszmen Sample 1 Sample 2 men | clas „| Sample 1 | Sample 2 men | —— 

| N x 

—.176 .203 139 | —.279 108 | —.066 %s 427 | —.205 —.020 258 —,628 

| | 
| | 
431 451 | 488 | 452 —.590 | —,696 —.802 —325 | 183 127 048 —.153 
| | | 
.000 069 —.068 —.201 —.250 —.728 256 229 107 0 | -311 —524 
| | | | 
.000 019 —.229 | —.188 —.268 —.167 .038 | 228 | —.147 | —.102 | —.296 —.488 
885 | 972 1.184 1.199 | —1.183 | —1.481 | —1.279 | —1.118 | —.005 037 002 —.014 
| 
| | | 
—.416 —.750 | —.455 —,529 —137 | —.148 131 306 164 272 287 280 
| 
-.379 —.402 —.467 —.376 305 327 324 349 285 305 33⁰ 198 
| | 
299 150 281 299 —612 | 493 | —.421 | —.252| -.338 | —.144 | —.391 —.387 
| | | 
«254 .341 223 185 —.514 | —.996 | —.535 | —.218 +315 | —.121 —.432 | —.688 
| | 
—.646 —.352 —.752 —.629 561 418 424 373 „311 476 200 | 01 


value indicates that the item is preferred half the time when compared to the others. The higher the score, the higher the 


The presence or absence of this condition 
(consistency of behavior) was tested using 
the Dobell (1965) method. 

Consistency allows construction of a mean- 
ingful scale. If this condition, which might be 
called an internal consistency condition, is 
not met, then it is not possible to infer back 
from the scale the actual choices which would 
be observed in paired-comparison choice sit- 
uations. It should be noted that a lack of in- 
ternal consistency does not necessarily imply 
unreliability—preference choices may be re- 
liably inconsistent. 

Internal consistency was greater for the 
college students than the high school students. 
Eliminating the counting of ties as “incon- 
sistent,” all organizational criteria and all but 
two career criteria (competent supervision 
15 security) were consistent for college stu- 

ents, 


Consistency between Samples 


Product-moment correlations between sam- 
ples were computed for all pairs of the four 
samples of data. 


Tables 3 and 4 give the sample intercorrela- 
tions. They range from .52 to .84; the mode 
and median is .75. The range for careers is 
greater than that for organizations. All are 
significant beyond the .01 level. 


DISCUSSION 


This article began with the concept that 
preferences both for careers and organizations 
must be measured concurrently because pref- 


TABLE 3 

CORRELATION MATRIX: ORGANIZATIONS 

Sample 1 | 2 3 | 4 
1. High school, 
Sample 2° 1.00 0.61 0.59 0.78 
2. University upper- 
classmen? 0.61 1.00 0.75 0.81 
3. University freshmen? | 0.59 0.75 1.00 0,76 
4, High school, 
Sample 14 0.78 0.81 0.76 1.00 
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TABLE 2.—PREFERENCE 


= — — — —— 
Business administrator Lawyer 
1 
Question High school University High school University 
Fresh- | Upper- Se | Fresh- | Upper 
Sample 1| Sample 2 wen | classmen | S*™Pl 1) Sample 2) men clasamen 
(a) On your first job. doing which do you | 
k there a ti unities to 
— rN —— worthwhile? * —.80 —.877 —. 162 —.069 —.072 349 
(b) On your first ron dong which do you 
t here 
— W 646 506 048, 056 428 086 
(c) On your first job, doing which do you 
think there will be more srantos 
2 TP as —A92 | —1,096 —.172 —.238 —.198 —.498 
(d) On your first job, doin; vein 8 
hi: success nd least 
on lucky circumstances? = —.100 —.179 -000 a141 | —.162 —.308 
(e) 8 yi 5 doin; hepa you 
would earn st 
F= pa 0 * 560 390 647 802 470 309 
U) On r first job, doing which do you 
think you would have more important £ 
responsibilities? —.333 | —.381 +129 001 107 —.215 
(£) In your first job, doing which do you 
think you would be most recognized for 
your contributions? —.469 | —.563 130 | —.008 117 —.022 
(h) In your first job, doing which do you 
think you would receive supervision 
from most competent superiors? —.183 | —.231 | —.085 | —.230 | —.076 | —.130 
(% In your first job, doing which do you 
think you would have the most secure 
future, with steady employment? 159 098 037 144 —.022 —.134 
Q) In your job, doing which do you 
think you would have the best chance of 
working with people with whom you 
could build lasting friendships? —.337 —.530 —.204 —.0⁵³ .046 —.224 
Note. — Data are Z-score values representing the average preference for each item compared to all other items. A zero (neutral) 


probability of preference, when compared to the other items. 


High school: Sample 1 n = 72; Sample 2 n = 25. University: Freshmen n = 116; Upperclassmen n = 69. 


erence patterns for business organizations 
may be considerably different than for busi- 
ness administration careers. The results tend 
to support the hypothesis: However, the dif- 
ferences which exist for any criterion would, 
the authors assert, be of small importance, 
if in fact the preference patterns of students 


TABLE 4 
CORRELATION MATRIX: CAREERS 


Sample 1 | 2 | 3 | 4 
1. High school, 
Sample 2° 1.00 0.52 0.59 0.84 
2. University upper- 
classmen> 0.52 1.00 0.75 0,63 


3. University freshmene | 0.59 0.74 1.00 075 
4. High school, 
Sample 14 0.84 0.63 0.75 1,00 


for careers and organizations were internally 
inconsistent. Vocational interest theory, com- 
bined with studies of the value of future in- 
come, must consider these social biases in 
order to clarify the -career-decision-making 
process, 

This study also investigated the possibility 
that there would be substantial agreement 
among students regarding preferences which 
may have an effect upon career decisions, ‘The 
four samples, two of high school students 
and two of college students, yielded prefer- 
ences which showed a high amount of agree- 
ment with each other. This agreement indi- 
cates that the preferences may have a con- 
siderable amount of stability across other 
such samples, 

The most important implication of this 
study is that neither business careers nor 
business organizations are universally down- 
graded by students in comparison to the other 
alternatives available to them, although gen- 
eral business administration careers may have 
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VALVES FOR CAREERS 
Research Clinical psychologist College professor 
High school University High school University High school University 
s t Fresh- | Upper- Fresh- | U| - * 8 
Sample 1 | Sample 2 men 8 Sample 1 | Sample 2) men cane Sample 1 | Sample 2 sm ——— 
009 563 438 418 190 27¹ .205 700 102 052 297 236 
097 oso | —334 | -.016 | —.358 | —.402 | —430 | —.219 | —.392 | -.102 | —.310 | —.358 
121 .213 207 649 —.319 | —.316 .056 675 334 403 426 384 
—.013 165 —.112 | =.023 006 | —.082 | —.007 .211 .192 | —.063 362 3⁰⁰ 
154 025 | —.102 157 | —.084 123 —.284 | —.298 | —.580 | —.791 —.620 | —.557 
031 199 | —.004 | —.100 137 ATL 120 «720 „191 .061 153 —.025 
633 575 .592 439 | —.126 | —.029 | —.147 154 | —.128 | —.165 | —.003 —.006 
360 .269 308 646 | —.238 | —.040 | —.00 | —.132 .040 062 | —.040 | —.153 
289 005 | —.323 | —.066 | —.115 | —.273 074 | —.079 268° 063 271 179 
ags | —305 | —.237 | —.280 | —.014 | —.346 010 -448 605 778 518 586 


value indicates that the item is preferred half the time when compared to the others. The, higher the score, the higher the 


difficulty in attracting these students com- 
pared to the professions for most career ob- 
jectives. The existence of commonly held 
stereotypes, such as found in this research, 
may indicate to businessmen, teachers, and 
guidance counselors that our culture may be 
transmitting some strong influences to stu- 
dents as they make their career decisions. 
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TASK COHERENCE, TRAINING TIME, AND RETENTION 
INTERVAL EFFECTS ON SKILL RETENTION’ 


JAMES C. NAYLOR,* GEORGE E. BRIGGS, ano WALTER G. REED? 
Ohio State University 


Amount of training, secondary task coherence, and length of retention interval, 
each at 2 levels, were evaluated in terms of long-term skill retention effects. 
The criterion task was composed of a 3-dimensional tracking task (primary 
task) and a 9-event monitoring task (secondary task). Retention loss varied 
inversely with amount of training and with secondary task coherence (the 
latter only under the lesser amount of training) for both tasks, Absolute re- 
tention levels varied directly with the training and task coherence variables 
and inversely with retention interval. From these and previous data, task 
coherence emerges as an important variable in skill acquisition and retention. 


In their review of the literature on skill re- 
tention, Naylor and Briggs (1961) concluded 
“that the retention of an arbitrary sequence 
of responses (whether verbal or motor) is 
less than the retention of responses which oc- 
cur in meaningful sequences or with patterned 
organizations [p. 6].” At first sight the re- 
sults of some recent tracking research by 
Trumbo, Noble, Cross, and Ulrich (1965) and 
by Noble, Trumbo, Ulrich, and Cross (1966) 
seem to be opposite to the above conclusion: 
They found greater losses in performance 
over the retention interval for more predict- 
able tasks. However, the absolute level of 
performance at retention test was better for 
the more coherent tasks, thus supporting the 
above conclusion. In a recent summary and 
review of their research, Noble and Trumbo 
(1967) suggest that the greater retention 
losses for the more coherent tasks may be due 
primarily to a deterioration in timing ac- 
curacy over the retention interval. This in- 
terpretation is consistent with a study by 
Naylor and Briggs (1963) which found re- 
hearsal on the temporal aspects of a task 
much more effective than rehearsal on the 
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spatial aspects in terms of retention perform- 
ance. Therefore, while deterioration occurs in 
both the temporal and the spatial aspects of 
learned responses, the former is the more 
crucial loss to skill. 

In a more recent study, Trumbo, Noble, 
and Swink (1967) utilized a tracking situa- 
tion as a primary task and added a secondary 
task (verbal anticipation or guessing a se- 
quence of numbers) during retention testing 
(Experiment I). Coherence was varied sys- 
tematically in both tasks, and, while primary 
task performance during retention was in- 
ferior when the subject (S) experienced the 
two tasks simultaneously, there was no differ- 
ential effect of the level of predictability of 
the secondary task on primary task perform- 
ance. As in their previous work, the authors 
found differential retention performance as a 
function of primary task coherence. 

It seems odd that secondary task coherence 
would not produce differential effects on pri- 
mary task performance: From the concept of 
a central, limited channel for information 
processing (Broadbent, 1958), one would pre- 
dict a differential influence of both primary 
and secondary task coherence provided, of 
course, that the experimenter (E) selected 
sufficiently different levels of secondary task 
predictability. In other words, it is conceiv- 
able that Trumbo et al. (1965) did not differ- 
entially load the channel in their two sec- 
ondary task conditions where the obtained 
average uncertainty in the stimulus sequence 
was .542 and 1.882 bits/signal. 

The present research examined the influence 
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of two levels of secondary task coherence on 
primary (tracking) task performance during 
both learning and retention. Both tasks were 
more complex than those utilized by Trumbo 
et al. (1967), and an attempt was made to 
assure a significant separation of secondary 
task coherence levels. However, both of the 
coherence levels used here provided S with the 
opportunity for errorless performance on the 
secondary task, in contrast to the Trumbo 
et al. (1965) research where S could im- 
prove with practice on the secondary task but 
could not achieve errorless performance ex- 
cept by chance. 

It was predicted that both acquisition and 
retention performance would vary directly 
with secondary task coherence for both the 
primary and the secondary tasks. In addi- 
tion to secondary task coherence, the amount 
of training (2 or 3 wk.) and the retention in- 
terval (1 or 4 wk.) were varied systematically 
to determine if the above prediction would 
hold across these two highly pertinent vari- 
ables in retention research. 


METHOD 
Apparatus 


The S performed two tasks simultaneously: a 
three-dimensional tracking task and a nine-event 
monitoring task. The primary (tracking) task pre- 
sented S with three center-reading voltmeter dis- 
plays which provided system error information, and 
he attempted to null out such error by manipulating 
a single control column via left-right, fore-aft, and 
rotation responses. Each of the tracking dimensions 
involved rate dynamics and the control-display gain 
was .025 ips of display pointer movement per 
degree of control device movement. The control was 
spring loaded in all three dimensions at approxi- 
mately 1 oz/deg of arc, and a single sinusoid of .6 
cpm served as the tracking input signal to all three 
dimensions of the primary task. 

The secondary (monitoring) task presented S with 
nine rows of stimulus and response devices to indicate 
nine events. Within each row there were (from left 
to right) three response buttons, plus an amber- and 
4 red-jewel light. An event occurred every 6 sec. 
If an amber light occurred, S was to press the but- 
ton labeled “OK” for that event; if a red light ap- 
peared, he first pressed the “Emergency” button 
(which extinguished the red light and activated the 
amber light), and then he was required to press the 
OK” button; and if no light occurred at the ex- 
Dected time, S pressed first a button labeled Check“ 
and this activated the red light, which in turn re- 
quired him to press first the “Emergency” and then 
the “OK” buttons, In all cases a correct sequence 
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of responses within 4 sec. of event onset resulted in 
an amber light being locked in; an incorrect sequence 
resulted in a red light being locked in 

The secondary task was located to the left of the 
tracking display and the separation was such as to 
require S to look away from the tracking display in 
order to respond, although the secondary display 
lights were of sufficient intensity to be clearly ap- 
parent in peripheral vision 

Secondary task coherence was manipulated in terms 
of the spatial sequence of events: In the highly pre- 
dictable conditions the spatial sequence was 1, 2, 3, 
4, 5, 6, 7, 8, 9, where 1 refers to the top pair of 
lights, 2 to the next pair, and so on; and for the 
low coherence condition the spatial sequence was 1, 5, 
2, 9, 8, 3, 6, 7, 4. Stated in terms of the amount of 
change within the spatial sequence of events, the 
high coherence sequence involved 0 bits of uncer- 
tainty while the low predictability sequence involved 
2.16 bits of uncertainty per change. 


Design and Procedure 


Table 1 summarizes the treatments for each of 
the eight experimental conditions. The latter re- 
sulted from factorial combinations of (a) training 
time (2 or 3 wk.), (b) retention interval (1 or 4 
wk.), and (c) monitoring task coherence (high and 
low). Each group was filled with 16 male under- 
graduates who volunteered for service and were 
paid $1.00 per session. Group assignment was made 
on the basis of total tracking task performance during 
the sixth session. 

All Ss experienced part-task training through the 
fifth session with whole-task training and retention 
testing thereafter. Except for the first session, which 
involved four 70-sec. trials on each of the two tasks, 
there were twelve 70-sec. trials per session. The 
tracking task was always practiced as a whole (three- 
dimensional) task, and there was no change in the 
primary task from the first to the last session. How- 
ever, the secondary task was increased in complexity 
across the first five sessions in terms of the number 
of amber, red, and no-light events: during the first 
two sessions the stimulus events were always red or 
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EXPERIMENTAL CONDITIONS FOR THE VARIOUS 
Retention Groups 


Amount of Task Retention 
easy ed S organization 17 
1 2 High 1 
2 2 High 4 
3 2 Low 1 
4 2 Low 4 
5 3 High 1 
6 3 High 4 
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4 
ferential losses either as a main effect or in 
interaction with either of the other two in- 
dependent variables. 

In terms of absolute retention performance 
on the tracking task, Table 3 shows that all 
three independent variables were statistically 
significant as main effects. Thus, not only was 
superior retention performance obtained un- 
der high secondary task coherence and after 
3 wk. of training (compared to that under 
low coherence and after only 2 wk of train- 
ing), respectively, but also performance after 
a Ik. retention interval was superior to 
that after 4 wk. 

In summary, then, amount of training in- 
fluenced both the amount of deterioration in 
tracking performance and the absolute per- 
formance level at retention test; secondary 
task coherence differentially influenced the 
amount of deterioration only for groups which 
received 2 wk. of original training, although 
there was an overall effect in terms of ab- 
solute performance levels at retention test; 
and, while there was no greater loss in per- 
formance over a 1-wk. retention period than 
over 4 wk., the final average performance 
level after 1 wk. was superior to that after the 
longer retention interval, 


Monitoring Task Performance 


Table 4 provides summaries of the per- 
formance levels of the eight groups during 
training (final block of four trials) and re- 
tention test. It can be seen that very few 
errors of either type occurred and, in gen- 


eral, S was more apt to commit errors (extra 
responses) than to fail to respond. Tables 2 
and 3 provide the results of analyses of vari- 
ance applied to the monitoring task scores 
during retention test. 

Omissive errors. During the final training 
trials there were significantly more omissive 
errors by Ss who experienced only 2 wk. of 
training (Groups 1-4) than by those who 
experienced 3 wk. of training (F = 5.81, df 
= 1/124); however, unlike the case with 
tracking performance data, secondary task 
coherence did not differentially affect omis- 
sive errors during training. In terms of the 
amount of deterioration over the retention in- 
terval, the omissive error data revealed the 
same pattern as with the tracking data: 
greater losses occurred in the 2-wk. training 
groups than in the 3-wk. groups, and the 
Training X Coherence interaction was sta- 
tistically significant (see Table 2). From 
Table 3 it can be seen that omissive errors 
followed the same pattern as found for track- 
ing performance, except that retention in- 
terval was not significant and the Training X 
Coherence interaction was significant at p < 
05. Again, the pattern of this interaction was 
the same as that found for the relative loss 
scores on the tracking and monitoring tasks 
(Table 2): the worst performance was 
achieved under the 2-wk. training, low co- 
herence conditions (Groups 3 and 4) with 
fairly comparable performance for all other 
groups. 

Comissive errors and response time. These 
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measures of monitoring task performance 
less sensitive to the effects of the inde- 
g nt variables than either the omissive 
error or the tracking error scores, For ex- 
there were no signiñcant differences 
; either set of scores during training and 
there was only one significant F ratio from 
the two retention analyses of the 
‘time scores: amount of training for the ab- 
“wlute retention data (sce Table 3). Those few 
A tases of significant F ratios for both commis- 
error and response time data do reveal 
ults entirely consistent with those of the 
tking error and omissive error data; for 
ple, better retention performance oc- 
ed after 3 than after 2 wk. of training in 
$ of both commissive errors and response 
(Table 3), and there were more com- 
$ errors after 4 wk. than after 1 wk. of 
retention (Table 3). It is interesting to note 
the commissive error data were the only 
es to show a significant effect of retention 
erval for both the relative loss and the 
“absolute performance analyses (see Tables 
2 and 3). 
This lack of sensitivity of the response 
time and commissive error scores to the in- 
dependent variables may well have been due 
to the delay in feedback which S experienced 
with regard to these two aspects of perform- 
ance: the S obtained such information only 
Siw supplementary feedback at the end of 
tach daily session when Æ provided a debrief- 
ing. The feedback on tracking error was im- 
mediately and continuously available from 
he three tracking displays, and a failure to 
_Tespond appropriately to each event in the 
monitoring task was signaled almost im- 
‘mediately by a red light on the monitoring 
display. 


Discussion 


The above results indicate that each of 
the independent variables influenced both 
a original acquisition and retention tracking 

Performance in a straightforward and pre- 
dictable manner, with two exceptions. Thus, 

ing original acquisition, tracking per- 

ce was superior after 3 wk. com- 

_ Pared to that after 2 wk. of training, and 
Superior tracking performance occurred when 

) accomplished with simultaneous practice on 
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(e) after 3 rather than 2 wk. of original train- 
ing, (4) when accomplished simultaneously 
with the more coherent secondary task, and 
(c) after 1 wk. rather than after 4 wk. of 
retention. Finally, in terms of performance 
loss scores (Table 2), there was less loss fol- 
lowing 3 wk. of original training, and there 
was less tracking performance loss when 8 
experienced the more predictable secondary 
task, but this effect held only for those Ss 
who received only 2 wk. of original training. 
The latter was one of the two exceptions 
noted above: secondary task coherence did 
not appear as a main effect but as an interac- 
tive effect with amount of training in terms 
of retention loss. 

The other exception was that retention in- 
terval did not differentially affect perform- 
ance Joss although it did influence absolute 
retention performance, as noted above. This 
latter exception is actually a bit puzzling, 
since the magnitude of differences between 
means due to the retention interval variable 
was actually larger in the difference score 
analysis than it was in the absolute score 
analysis. One possible explanation for obtain- 
ing nonsignificance due to retention interval 
in the difference score analysis is that the 
within-cell variance was markedly increased 
in this analysis due to the unreliability of in- 
dividual S’s performance. Since the variance 
of a set of difference scores is Sp? = $47 + Sa? 
— 2ranSaSp, a low correlation between the 
training and retention test scores of indi- 
viduals within a group would result in a 
rather large sp, which in turn would result 
in a corresponding increase in the MS error 
term used in the analysis of variance. 

Essentially the same pattern of results 
was found with performance on the secondary 
(monitoring) task. In terms of omissive er- 
rors, the only discrepancy between results 
on the two tasks was that retention interval 
did not influence performance as a main ef- 
fect for either the retention loss data or the 
absolute retention scores. Retention interval 
was a statistically significant effect in terms 
of commissive errors, however. 
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It follows, then, that the major prediction 
was supported, on the whole, by the results 
of this experiment: Both during training and 
at retention test, superior primary task per- 
formance was obtained when it occurred 
simultaneously with the more coherent ver- 
sion of the secondary task, and performance 
on the secondary task itself during retention 
(but not during training) was superior under 
the more coherent than under the less pre- 
dictable version, The latter result is con- 
sistent with those of Trumbo et al. (1965) 
and Noble et al. (1966), while the results 
noted here for the retention of tracking per- 
formance extend those of Trumbo et al. 
(1967, Experiment I) by showing that sec- 
ondary task coherence can affect primary 
task performance in a predictable manner. 

The most consistently influential and the 
most powerful independent variable in this 
experiment was the amount of original train- 
ing: on both tasks during training and at re- 
tention, performance levels were superior for 
groups which experienced 3 rather than 2 wk, 
of original training, and the loss of proficiency 
on both tasks was less for the more highly 
trained groups. Further, the interactions noted 
in Table 2 of Training x Secondary Task 
Coherence indicate that not only can one 
reduce retention losses on both primary and 
secondary tasks with training, but also the 
detrimental effect of low coherence can be 
ameliorated with sufficient original learning. 

Surprisingly, the effect of retention interval 
was less consistent than that of either of the 
other two independent variables. The fact 
that retention interval did not interact sig- 
nificantly with secondary task coherence in- 
dicates that the effect of the latter is uniform 
across at least a 4-wk. interval, Logically, one 
would expect such an interaction to be sig- 
nificant, and visual inspection of Figure 1 
seems to support this expectation: there ap- 
pears to be a greater difference between Groups 
2 and 4 than between Groups 1 and 3 at 
retention test. However, again, this observa- 
tion was not supported by statistical test, and 
obviously from Figure 2 there is no greater 
difference between Groups 6 and 8 than be- 
tween Groups 5 and 7. Had a 6- or an 8-wk. 
retention interval been utilized instead of the 
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4-wk. interval, it is quite possible that the 
expected interaction would have been found, 
Indeed, one may predict that the interaction 
of Amount of Training X Secondary Task 
Coherence X Retention Interval would be sig- 
nificant provided one used either lesser 
amounts of original training or greater reten- 
tion intervals than those used herein. 

While there is danger in speculation, the 
above does provide further (albeit indirect) 
evidence for the very powerful effects of the 
amount of training variable: Overlearning 
can (and apparently did) mask or ameliorate 
the effects of less powerful independent vari- 
ables, especially their interactive effects. 

The authors’ interpretation of the coherence 
variable, as indicated in the introduction, is 
that by manipulating task predictability one 
loads differentially the central information- 
processing channel. The less coherent a sec- 
ondary task input, the more the channel must 
be concerned with the selection and execution 
of responses for the secondary task, and, 
therefore, the less the capacity which is 
available for primary task performance. It 
follows, then, that during training primary 
task performance should vary directly with 
the coherence of the secondary task, and this 
was the result obtained. 

During retention test one encounters not 
only the above effect of secondary task co- 
herence but also less accurate retention of 
the input sequence for the less predictable 
secondary task which compounds the load on 
the central channel. Presumably, this reduces 
further the capacity of the channel to deal 
with the primary task. 

The finding that amount of training can 
ameliorate the effects of secondary task co- 
herence on retention performance (at least in 
terms of retention loss) is consistent with the 
interpretation of a hierarchical organization of 
skill (see Chapter 2 of Fitts & Posner, 1967). 
As (secondary task) skill becomes more au- 
tomated through practice, it places less load 
on the central channel, and thus there should 
be less potential interference to the processing 
of information on another (primary) task 
performed simultaneously. 

Whatever the theoretical interpretation, 
task coherence emerges from these and previ- 
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ous data as an important determinant of 
skilled performance under long-term retention 
conditions. 
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USE OF IMPORTANCE AS A WEIGHTING COMPONENT 
OF JOB SATISFACTION 


PATRICIA SMITH MIKES anp CHARLES L. HULIN I 
University of Illinois, Urbana 


Hypotheses dealing with the rated importance of various aspects of the job were 
tested on data obtained from 660 male and female workers in Montreal, 
Quebec. Importance used multiplicatively to weight satisfaction scores did not 
increase the predictability of turnover during an 11-mo. period over that ob- 
tained by using satisfaction measures alone. No interaction between satisfaction 
and importance predicting turnover was obtained. The general conclusion was 


that importance has little value in a 


attitudes and behavior. 


Strong and consistent relationships between 
employees’ responses to attitude question- 
naires and job behavior have only rarely been 
obtained. It has been implied frequently that 
those aspects of the job which are perceived 
as more important by the worker have a 
greater influence on the behavioral and over- 
all affective responses of the worker to his 
job than do less important aspects, However, 
most job-attitude questionnaires fail to ask 
employees about aspects regarded as more 
or less important. 

Several investigators (Ewen, 1967; Fried- 
lander, 1965; Glennon, Owens, Smith, & Al- 
bright, 1960; Weiss, Dawis, England, & 
Lofquist, 1964a, 1964b) have used importance 
measures in conjunction with satisfaction in 
an effort to demonstrate theoretically and 
empirically more appealing measures. All but 
one concluded that a consideration of im- 
portance is necessary. Ewen, however, con- 
cluded that importance added little. His re- 
sults showed that a summed composite score 
weighted differentially for importance ratings 
was no more strongly related to a measure of 
overall satisfaction than was an unweighted 
composite of the same satisfaction measures. 

The research reported in this paper presents 
what the authors believe to be more definitive 
evidence on the importance of importance. If 
importance is a vital component in the assess- 
ment of job satisfaction then overall job 
These data were gathered in conjunction with 
and reported as part of the senior author’s master’s 
thesis submitted to the University of Illinois. The 
authors would like to thank George Graen and 
Harry Triandis for the comments on an earlier draft 
of the manuscript. 


prediction situation involving job 


satisfaction is dependent on the worker’s at- 
titude toward the various aspects of the job, 
and also on the perceived importance of these 
aspects to him. His general reaction to his 
job would depend more heavily on the feeling 
he holds toward those aspects which he con- 
siders as more important than on his feel- 
ings toward those aspects which are less im- 
portant. A weighted composite measure should 
yield a measure of overall satisfaction with the 
job which is more highly related to behaviors 
such as turnover or absences than an un- 
weighted composite measure. The model pro- 
posed is a multiplicative interaction of im- 
portance and satisfaction with a linear com- 
bination of the products. 

The S’s composite job attitude as measured 
by the product of his importance rating and 
satisfaction score for each job aspect summed 
over all aspects will correlate more highly 
with termination decisions than will either 
the summed importance ratings or an un- 
weighted sum of the satisfaction measures. 

Evidence supporting this hypothesis would 
indicate that the perceived importance of job 
aspects is necessary if respectable predictive 
ability is to be achieved. Negative evidence 
would indicate that Ewen’s (1967) basic con- 
clusions can be generalized to the prediction 
of behavioral responses. (Such negative evi- 
dence could mean that the importance of any 
aspect of the job is reflected in the reported 
satisfaction with that aspect of the job. That 
is, only those job aspects which are important 
to S will be responded to by him with a satis- 
faction score which deviates substantially 
from the neutral point, Unimportant job 
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aspects would receive essentially neutral satis- 
faction scores and would enter the predictive 
equations with very small effective weights.) 

Negative evidence could also be interpreted 
as a reflection of the reliability of a composite 
measure as compared to the reliability of a 
simple sum. Every time one measures a vari- 
able some error is introduced. If satisfaction 
with each job characteristic is measured with 
a reliability of .80 then one would expect 
the sum of the five importance scores to have 
reliability considerably in excess of .80.* How- 
ever, this reliability would be greater than 
the reliability of the same five satisfaction 
scores if they were multiplied by a fallible mea- 
sure of importance before they were summed. 
Even if the measures of importance used 
have high (but less than perfect) reliability, 
the resulting sum of Importance X Satisfac- 
tion would have an overall reliability much 
below the reliability of the simple sum. The 
maximum obtainable correlation between the 
simple sum and any criterion would be greater 
than the maximum obtainable correlation be- 
tween a composite measure and the criterion. 
Tf all the correlations were corrected for at- 
tenuation it is possible that these corrected 
correlations would indicate the theoretical 
superiority of the composite sum of job satis- 
faction. However, the problem cannot be re- 
moved by demonstrating the theoretical supe- 
riority of one measure over another. It is an 
unfortunate fact of life that every time one 
introduces the notion of importance in the 
study of job satisfaction such importance vari- 
ables will have to be measured. Importance 
measures cannot be discarded on the basis of 
the available data. However, the claims that 
importance must be considered in the study 
of job satisfaction are unfounded since such 
claims cannot ignore the problems of relia- 
bility. 

METHOD 

Subjects 

The Ss for this study were the corporate ofice 


personnel of a large international corporation with 
its corporate offices located in Montreal, Quebec. In 


2The use of one of the Kuder-Richardson 
formulas would be inappropriate to estimate the re- 
liability of the sum since the measures of satisfaction 
with the separate job components are not homogene- 
ous parts of a longer test. 
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July 1966 all of the employees were requested to 
take part in a job-satisfaction survey which the 
company was conducting in cooperation with the 
second author. The questionnaires were given to the 
employees by departmental representatives after the 
representatives had met with the investigator and 
had the questionnaire explained to them, The em- 
ployees were asked to complete them at home or, if 
they thought they would have 30 uninterrupted 
minutes, to do them at work. All employees were 
asked to participate. Completed questionnaires were 
placed in envelopes, sealed by the respondents, and 
returned either to the investigator or sent through 
the mails directly to the University of Illinois. 
Anonymity was guaranteed to all respondents and 
they were assured that their individual responses 
would not be made known to the company. Names 
were not requested on the questionnaires but for 
many of the employees their job title and depart- 
ment (which were requested) would be enough to 
identify the respondent. Of the 800 employees who 
worked in the home offices and who were present 
during the period of the study, 670 (83%) returned 
usable questionnaires. 

Variables 

Satisfaction with five aspects of the job (work 
done, pay level, promotional opportunities, super- 
vision, and co-workers) was assessed by means of 
the Job Descriptive Index (JDI) (Smith, 1967) 
which is a cumulative-point adjective checklist. In 
addition, the General Motors Faces Scale (Kunin, 
1955) was used to assess job satisfaction with the 
same five job aspects. In this particular study a 6- 
point version of the original 11-point scale was used. 
The six faces used consisted of two scowling faces, 
one neutral face, and three smiling faces. A measure 
of overall job satisfaction was also obtained. The 
same 6-point version of the General Motors Faces 
Scale was used to assess overall satisfaction. 

Importance was measured by a scale designed to 
give interval scale values of importance. A descrip- 
tion of this scale may be found in Hulin and Smith 
(1967) or Mikes (1967). This scale provided ratings 
of importance of six aspects of the job (interest and 
difficulty of work, pay, promotion, supervision, co- 
workers, and working conditions). Satisfaction with 
work was used in conjunction with both the rated 
importance of interest and difficulty of work and of 
working conditions since the satisfaction measures 
may not adequately discriminate between these as- 
pects of the job. 

Background variables (job level, sex, age, tenure) 
were obtained from the face sheet of the question- 
naire which asked for this and other information. 
Job level was quantified from job listings of the 
company by matching the job title reported on the 
face sheet with the job evaluation scale provided by 
the personnel department of the company involved. 
All of these measures were obtained in July 1966. 

The reasons given by the terminators for quitting 
were recorded during the initial stages of this study. 
The plan was to eliminate those Ss from the sample 
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who were “unavoidable” quits (pregnancy, husband 
leaving town, etc). However, the be- 
tween avoidable and unavoidable quickly became 
meaningless, Within the first month two girls who 
were pregnant reported to the personnel department. 
One terminated. Her questionnaire was placed in the 
unavoidable category. The second girl did not quit 
but only requested a maternity leave and planned to 
return to work. It was imposible to justify the ex- 
clusion of the first girl, The decision was made to 
include all terminators regardless of stated reason 
for quitting. 


Resvutts 


By May 1967, 6 males and 16 females who 
had participated in the study and who could 
be identified had quit their jobs. No correla- 
tional analyses were performed on the male 
sample since the number of cases was too small. 
Analyses for the female sample were per- 
formed on a group of 48 Ss, 16 terminators 
and 32 nonterminators who were matched 
on age, length of service, and job level with 
the terminators. Biserial correlations were 
computed between the sum of JDI scores and 
turnover (scored 1 for termination, 0 for non- 
termination), the sum of the Faces scores with 
job aspects and turnover, the sum of im- 
portance· weighted JDI scores and turnover, 
the sum of importance-weighted Faces scores 
and turnover, the measure of overall job satis- 
faction and turnover, and the sum of im- 
portance scores and turnover. The results 
of this analysis are presented in Table 1. 

The hypothesis was not confirmed. The un- 
weighted sums of the satisfaction scores alone 
proved to be as highly related to termination 
decisions as the sums of the importance- 


TABLE 1 


BISERIAL CORRELATION BETWEEN PREDICTOR 
VARIABLES AND TURNOVER 


j Correlation 
Variable with turnover 

Sum of JDI scores —.46 
Sum of JDI scores weighted by 

importance —.33 
Sum of Faces scores —.41 
Sum of Faces scores weighted by 

importance —.39 
Job- in- general Faces score —.34 
Sum of importance scores —.22 


Note—Females, N = 48, 
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weighted satisfaction scores. The least ac 
curate predictor was the sum of importance 
scores alone. 

One possible explanation for the lower 
predictive validity of the importance-weighted 
satisfaction scores was that the multiplicative 
model used did not distinguish between job 
aspects high in importance but low in satis- 
faction and job aspects low in importance but 
high in satisfaction. The first case would be 
an indicator of future turnover but the latter 
case would not. To test this problem of lack 
of symmetry, the female terminators and their 
matched control Ss were dichotomized at the 
medians of satisfaction with work itself and 
the importance of work itself. The same pro- 
cedure was followed using the 6 males and 
their 12 matched control Ss. In each case the 
medians were computed separately for the 
male and female Ss and therefore different 
median values were used for each of the two 
samples. Satisfaction with work itself was the 
variable selected for this analysis because 
it had the highest correlation with turnover 
of any of the five JDI measures. The 
dichotomized male and female turnovers and 
their matched controls were then combined 
preserving the median splits which had been 
obtained from the separate analyses of the two 
samples. This dichotomization and combining 
of male and female workers yielded a four-cell 
table based on a total N of 66 (22 terminators 
and 44 matched nonterminators). The ratio of 
terminators to total cell numbers in each of 
the four cells was then computed. The inter- 
active model of importance and satisfaction 
would be supported if the 2 x 2 table in- 
dicated a high turnover ratio in the high- 
importance-low-satisfaction cell, a low ratio 
in the high-satisfaction-high-importance cell, 
and intermediate turnover ratios in the other 
two cells. These data are presented in Table 2. 
No statistical analyses were done on these 
data but an inspection indicates the existence 
of main effects due to satisfaction and im- 
portance but no interaction effects are present. 


Discussion 


The multiplicative model presented in this 
paper received little empirical support. The 
sum of satisfaction scores with unit weights 
did as well as the composite sum of satisfac- 


Weicutixc Comronents e Jos Sarisraction 


tion scores when weighted by importance in 
the prediction of turnover, The intuitively ap- 
pealing idea that importance is important in 
the study of job satisfaction received no sup- 
port. The most immediate conclusion which 
could be reached on the basis of these data 
would be that there is no need to use import- 
ance and an interactive model to link satis- 
faction with the different aspects of the job to 
the behavioral responses to the job since the 
simpler model assuming unit weights is more 
parsimonious and is at least as valid. 

There are alternative conclusions which 
could be reached on the basis of these data. 
It is possible that the interactive model of job 
satisfaction is valid but that the measures of 
satisfaction used in this study are invalid. 
Such a conclusion lacks force, however, when 
the validity data for the JDI and Faces mea- 
sures of satisfaction are examined. These data 
(Smith, 1967) indicate high convergent and dis- 
criminant validity for the JDI. The numerous 
studies which have appeared recently using 
the General Motors Faces Scale of overall 
satisfaction have yielded results which would 
be impossible if the Faces Scale lacked validity 
(Ewen, Smith, Hulin, & Locke, 1966; Graen, 
1966; Hulin & Smith, 1967). It is also pos- 
sible that the multiplicative model of job 
satisfaction is basically valid but the measure 
of importance of the various job aspects em- 
ployed lacks validity. The data presented by 
Mikes (1967) indicate high scale reliability 
for this measure. The data pertaining to the 
convergent and discriminant validity are less 
impressive. Nevertheless, these data indicate 
at least an adequate amount of convergent 
validity. 

One psychometric consideration must be 
discussed. What was done in this study was 
to weight a limited number of variables using 
idiosyncratic, individually determined weights. 
Using such idiosyncratic weights, the result- 
ing sum of the weighted variables will have a 
correlation in excess of .90 with the unweighted 
sum of the same variables (Ewen, 1967). This 
extremely high correlation, of course, reduces 
drastically the chances of demonstrating the 
Superiority of one measure or another. 

However, it should be stressed that this 
psychometric consideration is not unique to 
this particular study. Any study which uses 
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TABLE 2 
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— See 
Satie | Importance of work ite 
% — 

with work Bos 
Iten High Low 
High | 5/23 m| 3/10 oo| s33 

Lasis d daa oai aiar da À 
Low 3/10 (0%) | 1/28 (40%) | 14/35 

22/66 


Note.—Numeratory 
nominators indicate 


cells are samber whe quit and de 
cell frequency. 


importance as a weighting measure to weight 
measures of job satisfaction will have the same 
problem associated with it. This means that 
claims for the superiority of weighted com- 
posite measures of job satisfaction are very 
likely unjustified. 

It is also possible that had this study used 
a different behavioral measure as its criterion 
the results would have been more promising 
for a multiplicative model. Termination de- 
cisions were used as the dependent variable 
because this is the one behavioral response 
which has indicated consistent and high re- 
lationships with affective responses to the job 
(Hulin, 1966, 1968; Weitz & Nuckols, 1953). 
The use of productive behavior as a dependent 
variable would have created many problems 
since the previous studies on job satisfaction 
which attempted to link affective responses to 
productive behavior have not indicated any- 
thing like a consistent or significant relation- 
ship (Brayfield & Crockett, 1955; Vroom, 
1964). 

In addition, more promising results might 
have been obtained had other prediction 
models been used to combine the satisfaction 
and importance data. The use of other ad hoc 
models (such as a multiple-regression predic- 
tion equation) would clearly have taken ad- 
vantage of the negative relationships between 
importance ratings and turnover, the negative 
relationships between satisfaction and turn- 
over, and the zero association between satis- 
faction and importance to enhance the predic- 
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tion of turnover. However, with only 48 Ss in 
the present analysis sample such analyses 
could have proved misleading. 

It should also be pointed out that this study 
does not stand alone. The research of Ewen 
(1967) while using a different dependent 
variable (overall job satisfaction) yielded re- 
sults no more promising for an interactive 
model of job satisfaction. It would appear that, 
every time the intuitively appealing use of 
importance in combination with satisfaction 
has been investigated in order to determine 
the validity of such a model, negative results 
have been obtained. 

In summary, it appears that the use of 
importance measures as multiplicative weights 
in a model of job satisfaction does not in fact 
yield the relationships between a composite 
measure of job satisfaction and behavior, 
or between a composite measure of job satis- 
faction and overall satisfaction which are 
stronger, more consistent, or more theoretically 
appealing. The simpler model of job satisfac- 
tion assuming unit weights for the measures 
does as well. 
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EVALUATING MORTGAGE LOAN RISK 


WILLIAM D. BUEL 
Byron Harless, Schaffer, Reid & Associates, Inc., Tampa, Florida 


Mortgage loan application blanks for 


matched samples of delinquent and non- 


delinquent mortgages were statistically analyzed for purposes of identifying 
items capable of discriminating between the 2 samples. 8 bits of information 


(4 application blank items and 4 rat 


ios) differentiated delinquent from non- 


delinquent accounts and, taken in total score form, gave rise to a biserial cor- 


relation of .64 (p< .001). 


Recent efforts by psychologists have demon- 
strated the validity of weighted credit appli- 
cation blanks. McGrath (1960) reports a 
maximum biserial correlation of .58 (several 
weighting methods were compared) between 
a total score based on 24 weighted credit ap- 
plication blank items and a payment-reposses- 
sion criterion relative to automobile purchase, 
while Hassler, Myers, and Seldin (1963) dis- 
covered nine items capable of producing a 
biserial correlation of .71 when taken in total 
score form relative to differentiating “open” 
from “special handling” department store ac- 
counts. Elsewhere, Myers (1963) describes 
point-biserial correlations ranging from .26 to 
.61 for various weighting methods relative to 
differentiating paid from delinquent accounts 
and accepted from rejected applicants, re- 
spectively. 

Literature searches by the present author 
have disclosed no such efforts relative to 
mortgage loans. In fact, the “lore” of the 
mortgage-lending industry suggests that risk 
evaluation would be relatively difficult be- 
cause payment on one’s home looms more im- 
portant than payment on less significant items, 
and therefore elicits a different set of pay- 
ment motives, motives alleged to cause mort- 
gage installment payments where “other” 
periodic remittances might be neglected. 
While such motives may operate, the present 
study exhibits validity comparable to the 
validity of credit scoring systems designed for 
other purposes. 


CRITERION 
So that payment records would be availa- 
ble for parallel samples of delinquent and 
nondelinquent mortgage loan recipients, this 
study employed historic data, data arising 
from the mortgage application blanks of indi- 


viduals to whom mortgage loans had been ex- 
tended for purposes of home construction or 
purchase—judgments about delinquency-non- 
delinquency could be made only on the basis 
of real experience. 

Examination of the organization’s delin- 
quent files suggested an artificially dichoto- 
mous criterion. Delinquency was therefore 
defined as 1 or more month’s failure to pay, 
whereas nondelinquency suggested no pay- 
ment failures. While such stringent definitions 
make differences difficult to discover, they 
were forced upon the investigator by sample- 
size constraints, that is, no usable data could 
be spared. 

METHOD 


The mortgage loan application blanks to be studied 
were drawn from files, by date of loan issuance, for 
the period February 1960 to April 1966. Once 150 
mortgage loan application blanks representing delin- 
quent accounts had been selected, 150 representative 
of nondelinquents were chosen by searching the files 
immediately following each of the delinquent accounts, 
selecting only those blanks which could be matched 
with a delinquent account blank on date (within 30 
days), purpose of application (construction or pur- 
chase), and present address (town). Accordingly, 
these matching variables served as their own control 
and are, across both samples, not necessary of sepa- 
rate statistical treatment. 

After selection of 150 delinquent and 150 nonde- 
linquent accounts, every third pair of files (50 de- 
linquent and 50 nondelinquent) was removed so as 
to provide for a 100-account cross-validation group 
which paralleled the 200-account item-analysis sam- 
ple (100 delinquent and 100 nondelinquent accounts) 
to be used in the analysis of the information on the 
mortgage loan application blank. Also, with reference 
to the delinquent group, the item-analysis and cross- 
validation samples were matched on number of 
months of delinquency. 


1 Appreciation is extended to Warren E. Backert 
and James E. Dirst of the Home Building and Loan 
Association of Aurora, Illinois for permission to pub- 
lish this study. 
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Thirty-one quantifiable or categorizable pieces of 
information were available from each mortgage loan 
application blank, as follows: amount of monthly 
payment, amount of loan, address, possession of tele- 
phone, availability of pledge, length of mortgage in 
months, purpose of mortgage, purchase price, down 
payment, total value of property (home and lot com- 
bined), value of lot, value of building, husband's 
occupation, husband's employer, husband's number 
of months of service, husband's salary per month, 
marital status, wifes employer, wife's months of 
service, wife's salary per month, other income (hus- 
band and wife), total salary per month (husband 
and wife), total income per month (husband and 
wife), age of husband, age of wife, number of chil- 
dren, total dependents other than self (including 
wife whether or not she was employed), value of other 
monthly payments, balance owed on those monthly 
payments, number of other accounts, and a 4-point 

rating from the local credit bureau.? These 

31 bits of information were analyzed, via tetrachoric 
coefficients of correlation,? to determine their power 
of differentiation between the delinquent and non- 
delinquent accounts in the item-analysis sample. 

In addition to the analysis of the items of informa- 
tion described above, 21 of them, either because of 
their quantitative nature or by logical dictate, lent 
themselves to inclusion in ratios with other items. 
Accordingly, a 21X21 paired-comparison matrix 
was prepared to facilitate consideration of all possible 
combinations. While 210 ratios could have been de- 


veloped =, logic suggested the elimination 


of certain of them. For example, the ratio “number 
of children” to “total dependents” is almost meaning- 
Jess since it breaks down into a series of constants 
for mortgagees with like numbers of children and de- 
pendents, Further, the intercorrelation of these two 
values is .9904, indicating overlap and nonindepen- 
dence. Such examples are not infrequent, and judg- 
ment dictated that only 79 ratios be calculated and 
tested for their significance of differentiation. 


RESULTS 


Four items of mortgage loan application 
blank information and four ratios differenti- 


2Some items of information (husband’s occupa- 
tion and husband’s employer are examples) were 
analyzed in such fashion as to reflect qualitative 
rather than quantitative differences, that is, to re- 
flect differences in occupation or employer relative 
to goodness or poorness of payment behavior. Un- 
fortunately, such categories were numerous and fre- 
quencies within them became meaninglessly small 
and useless, 

ĉTetrachoric coefficients of correlation were em- 
ployed in all item and ratio analyses because both 


largest delinquent-nondelinquent response difference) 
against all other intervals of item data. 
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ated delinquent from nondelinquent accounts 
at the Os level of significance or better (N = 
200). Those eight items and their tetrachoric 
correlation coefficients follow: wife employed 
(+.23), number of children (—.27), number 
of other accounts (—.28), credit rating 
(+.45), monthly mortgage payments/age of 
husband (—,30), amount of loan/total income 
(—.24), age of husband/number of depen- 
dents other than self (+.30), and age of hus- 
band/number of other accounts (+.25), 

Each of these items and ratios was assigned 
a set of differential weights ranging from 10 
to 0 within each item, weights which while 
Strictly intended for use with qualitative data 
were chosen for the sake of homogeneity of 
method across both qualitative and quantita- 
tive items (Wherry, 1944). Employing the 
eight weighted items as a scoring key for use 
with 100 mortgage loan application blanks 
held apart from the item analysis and cor- 
relating the total scores so derived with the 
delinquency-nondelinquency dichotomy gave 
rise to a biserial correlation of -64, significant 
at the .001 level of confidence. Figure 1 pre- 
sents the data from which this correlation 
arises, 

Further evidence of validity is connoted by 
the fact that the 50-person nondelinquent 
holdout group achieved a mean score of 
53.24, whereas the 50-person delinquent hold- 
out group achieved a mean score of only 
37.84, a difference of 15.40. Especially in light 
of the total score standard deviation (15.06 
with W = 100), the implications of such a 
mean score difference may be seen. Also, with 
regard to Figure 1, it is interesting to note 
that at several points in the distribution 40% 
more nondelinquents than delinquents equaled 
or exceeded a particular mortgage loan appli- 
cation blank score. The most dramatic evi- 
dence of this sort relates to a score of 50, 
a point at or above which 64% of the non- 
delinquent group scored—only 24% of the 
delinquent group achieved this level. 


Discussion 


While the personnel evaluator is most usu- 
ally concerned with positive discrimination 
(identifying individuals with good perform- 
ance potential), the mortgage loan evaluator 
is most interested in discriminating at the 
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Fic. 1. Results of applying mortgage application blank scoring key to holdout group (NV = 100). 


low end of the scale (identifying the indi- 
Vidual with poor payment potential). Accept- 
ing this inverted emphasis, an examination of 
Figure 1 suggests that rejecting individuals 
achieving mortgage loan application blank 
Scores of 34 or below would result in eliminat- 
ing individuals most of whom would be poor 
risks. Lowering screening standards just one 
Score interval (rejecting individuals scoring 
29 or below) has even more profound effects 
in terms of the percentage of known delin- 
quents who would have been rejected. 

As is obvious, delinquency and nondelin- 
quency do not comprise opposing cells in a 
dichotomy. Rather, they are the qualitative 
extremes of a continuum which can be con- 
Ceived of in quantitative terms. Perhaps the 
truth of the matter is that the items discussed 
here are differentiating with regard to “late- 
hess” of payment (quality), while the cri- 


terion defines lateness in terms of number of 
months of delinquency (quantity). Suffice it 
to say that at the upper extreme of the late- 
ness continuum, lateness evolves into risk for 
the loan-granting institution—risk expressed 
as the cost associated with efforts at collec- 
tion of back payments, indirect expenses re- 
lated to the clerical handling costs of delin- 
quent accounts, legal action, and so forth. 
The typical mortgage-lending institution is 
interested in the mortgagee’s “success” in 
maintaining a periodic payment schedule, and 
the items found valid in this study may be 
viewed as contributors to or inhibitors of such 
success. Considering the signs of the item cor- 
relations cited earlier, payment success would 
seem enhanced by a working wife, a good 
credit rating (a history of payment success), 
and relatively great age of husband in rela- 
tion to numbers of individuals dependent upon 
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him and number of other accounts for which 
he is liable. On the other hand, negative im- 
plications accrue to large numbers of children 
and large numbers of accounts other than 
mortgage, and to relatively great size of 
monthly mortgage payments relative to hus- 
band’s age and relatively great size of mort- 
gage loan in relation to family’s total income. 

As will be noted, age of husband entered 
into the calculation of various scorable ratios, 
even though age of husband did not in itself 
differentiate (this failure is true of all bits of 
information employed in the scorable ratios). 
While there is no question that these ratios 
will correlate with one another (they are 
functions of a common variable), the distri- 
butions arising from them were sufficiently 
different to warrant inclusion in the final key. 
In fact, previous research in the life insurance 
industry (Ferguson, 1967) has furthered the 
concept of “economic maturity” as a predic- 
tor of life insurance sales success, and it may 
be that somewhat the same sort of phenome- 
non is causing age-related ratios to differenti- 
ate mortgage-payment behavior, even though 
age itself is not significant for differentiation. 

As mentioned earlier, it was not possible to 
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delete accounts of 1 month’s delinquency be- 
cause of sample-size considerations. Accord- 
ingly, such were treated the same as truly de- 
linquent accounts (delinquencies in excess of 
1 year were not uncommon). This practice 
has the effect of masking differences, since 1 
month’s delinquency may represent oversight 
rather than true delinquency. As a result, the 
coefficients, differences, and percentages re- 
ported here may be underestimates of the 
practical and statistical significance of the 
scoring key. 
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WORD LEGIBILITY AS A FUNCTION OF LETTER LEGIBILITY, 


WITH WORD SIZE, WORD FAMILIARITY, AND 
RESOLUTION AS PARAMETERS 


R. I. ERDMANN axo A. S. NEAL 
International Business Machines Corporation, Los Gatos, California 


To determine the relationship between letter legibility and word legibility, in- 
dividual characters in context were compared with isolated characters, The 
words included in the stimulus material varied in familiarity and in length. 
2 independent variables were introduced to degrade legibility: character size 
and resolution, Characters of 4 heights were reproduced at each of 20 vertical- 
horizontal resolution combinations, Word legibility was found to increase with 
resolution and with character size. Word length, over the range investigated, 
proved to be unimportant, but word familiarity was an important determinant 
of legibility. Familiar words were found to be about as legible as individual 
letters, with a product-moment correlation coefficient of 88. Unfamiliar words 
were generally less legible than individual letters, with a coefficient of .94. 
If the degree of familiarity of words is known, word legibility can be predicted 


with reasonable accuracy on the basis of letter legibility. 


In efforts to measure legibility, many differ- 
ent criterion measures have been applied. 
Viewing distance and speed of perception have 
been the most widely used threshold mea- 
Sures; speed and accuracy of reading are often 
the basis for suprathreshold measurements. 
Words and nonsense syllables have been com- 
pared in investigations of legibility, with the 
general conclusion that meaning and famili- 
arity contribute to identification. Howes and 
Solomon (1951), studying the influence of 


familiarity on speed of recognition, found the 


a 


~ 


visual duration threshold (measured tachisto- 
scopically) to be a roughly linear function of 
word frequency based on the Thorndike-Lorge 
word count. The product-moment correla- 
tions between log word frequency and visual 
exposure duration ranged from —.76 to —.83. 

In subsequent experiments by Solomon and 
Postman (1952), pronounceable nonsense 
Words were introduced in order to control the 
“frequency of prior usage.” When the rela- 
tionship between tachistoscopic recognition 
threshold and frequency was measured, the 
Tecognition thresholds were found to vary in- 
versely with word familiarity (as it had been 
built into the experiment). Howes (1954) 
interpreted the Solomon-Postman results in 
terms of response emission probability rather 
than exposure duration threshold. In his view, 
Word frequency in a large general word count 
is best interpreted as the average probability 


of emission by the S (which Howes calls the 
“base probability”). Consequently, he argues, 
a short exposure will suffice for any word 
whose base probability is almost as great as 
its probability of occurrence in the set of 
words whose legibility is being measured. 

Since legibility can be defined in so many 
different ways, any new study should clarify 
its particular approach as precisely as possi- 
ble at the outset. Legibility is defined here 
as the average percentage of correctly iden- 
tified letters or words. Legibility is recognized 
as depending on variables in both the stimu- 
lus and the observer. Typically, stimulus ma- 
terial may vary in respect to character size, 
type style, height-width ratio of characters, 
stroke width, color, paper surface, and con- 
trast. Variables which include the user are, 
for example, viewing angle and distance, 
visual exposure duration, illumination, and 
visual acuity. There are additional influences, 
not so readily defined and measured, which 
are implicit in any attempt to measure legi- 
bility. For instance, the S$ is assumed to 
be familiar with the alphabet from which 
the test characters are chosen; otherwise the 
idea of recognition or identification would be 
meaningless. 

Recently, Saltz (1967) has called attention 
to fundamental problems in the way word 
frequency—and consequently, word familiar- 
ity—has been measured. He points out that 
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the Thorndike-Lorge word count disregards 
word meaning in counting frequency, making 
it impossible tọ distinguish the two potential 
influences on legibility. A single spelling 
(bear, for example) may represent two 
meanings, yet its occurrences are counted as 
if it were a single word. Familiarity, fre- 
quency, and meaning overlap in ways which 
blur the significance of the numbers used to 
represent each factor. 

Whatever the proper interpretation may 
be, the cumulative evidence indicates that 
readers recognize frequent or familiar words 


perception, is the criterion measure, this con- 
tinues to hold true. Taylor (1934) found 
meaningful words to be readable at a greater 
distance (approximately 50% greater) than 
nonsense words, whether the printing was 
black or white. Tinker (1963), investigating 
differences between uppercase and lowercase 
letters, found less than a 6% difference in 


distance, but the difference favored meaning- 
east It might be noted 


The redundancy in individual letters con- 
tributes of familiarity to any 
combination of letters, whether it is a mean- 
ingful word or a nonsense syllable. Tinker 
(1963) has shown that the upper halves of 
words are more legibile than the lower halves, 
and the work of Crooks (1959) gives some 
indication of the amount of degradation indi- 
vidual forms (other than words) can suffer 
and still be recognizable. In a sense, the pres- 
ent study attempted to determine how much 
character ion can occur before legi- 
bility is affected; the main objective, however, 
was to quantify the relationship between indi- 
vidual letter legibility and word legibility. 

In planning the experiment reported here, 
it was assumed that lowercase letters are more 
legible in the context of intelligible words 
than separately, character by character. This 
assumption is in keeping with work reported 
by Tinker (1963) which measured legibility 
by such criteria as fixation frequency, per- 
ception time, and perception distance. Infor- 
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mation already available (from Erdmann & 
Neal, 1966) had determined the resolution 
needed to make individual charagters legible 
(as required in parts lists, for example, and 
other arbitrary groupings). That resolution 
was assumed to establish the maximum reso- 
lution requirement for all conditions. There- 
fore, if systematic trends were found between 
word legibility and resolution, the correspond- 
ing relationship between word and letter legi- 
bility could be quantified, and future ques- 
tions about word legibility could be answered 
by research on individual characters. 

On the basis of these considerations, the 
present experiment was designed to determine 
the legibility of words as a function of hori- 
zontal and vertical resolution and character 
size, much as individual character legibility 
was determined by Erdmann and Neal 
(1966). Sinee other legibility studies (includ- 
ing those cited) suggest the need for more 
precise characterization of words, two poten- 
tially important variables—word familiarity 
and word length—were introduced as addi- 
tional experimental parameters. 


METHOD 


The approach taken was to (a) design a test docu- 
ment which incorporated certain differences (in char- | 
acter height, word length, and word familiarity) ; 
(b) reproduce the document, adding systematic dif- 
ferences in vertical and horizontal resolution; (c) 
submit the resulting stimulus material to Ss to meas- „ 
ure legibility; and (d) relate the legibility data thus 
derived to the physical variables under investigation. 
The procedure followed made it possible to start 
with a relatively small number of words, include 
them all in the original document, and then re- 
arrange the material systematically after printing to 
prevent interaction between the content and the ex- 
perimental parameters (word length and familiarity). 

The content includes 144 familiar and 144 un- 
familiar words selected from Thorndike and Lorge 
(1944) to represent a certain range of word lengths 
(see Figure 1). The “familiar” are among the 10@ 
words most common in English; the “unfamiliar” 
appear less often than once in a million words, and 
more often than once in 4 million. The row of indi- 
vidual characters below each set of words is made 
up of 17 letters in random order, with a, e, and o 
tepeated. The whole document is in lowercase Dual 
Gothic type, selected for its relatively uniform stroke 
width and its simplicity. 

Where the criterion measure is legibility, defined 
as the percentage of correctly identified letters or 
words, degradation variables are often introduced to 
control some aspect of the experimental situation— 
viewing time or distance, for example, or resolution, 
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remain complete 
chance although 

eabnejos 


oye however 
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ftatalist 
bisque armature 
fleste dlsinter 

um Un tert ee 

51% 

polygon 

oce 


plop cavor? 
lure 
ahoy 
9 1 
geo 
pnt 
hob 
dao 
re 


fugue 
ariel 
noway 


Fic. 1. Test document for word legibility study (one of four sessions) 


size, or contrast in the stimulus material. In the work 
reported he character size and resolution were 
varied, but high contrast was maintained through- 
out the printing process, In Sections B, C, and D of 
the test document, character heights were 40, 60, 
and 80 mils, respectively. In Section A (shown in 
Figure 1), characters were 100 mils in height, corre- 
sponding approximately to the 12-point type familiar 
in typescript and elsewhere. (The point system is 
Standard in typography, but does not specify charac- 
ter size exactly. Character height is the better basis 
for direct comparisons of the effects of character 
size.) 

The test document, mounted on the drum of a 
digital facsimile device for scanning and printing, 
was reproduced at 20 different combinations of hori- 
zontal and vertical resolution. The device scanned 
across the document in narrow bands, sampling 


` periodically to determine if the document at suc- 
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cessive points was black or white, The smaller the 
size of the scanning aperture and the more frequent 
the sampling, the greater the resolution. The resolu- 
tion obtained can also be described, as it is more 
commonly, by the number of samples or elements 
Per inch on the output copy. One can consider the 
scanned document to be covered by a grid, with 
each square in the grid representing a resolution 
clement and the dimensions of the grid being the 
shape of the scanning aperture. Figure 2a shows how 
Such a grid might reproduce one character on a docu- 
ment page. The character outlined represents the 


printed copy being scanned, and the shaded blocks 


show how the character would be “reported” by 
five successive scan lines, Figure 2b is an actual 
Photograph of a character before and after digital 
facsimile reproduction. à 
Five different horizontal resolutions and four dif- 
ferent vertical resolutions were used in this study. 
The size of the scanned element (square of the grid) 
for each of the 20 combinations is shown in Table 
1. From the 20 copies reproduced, four test booklets 
were assembled so that each group of words (Sec- 
tion A, B, C, or D on the master document) was 


* 


presented on five consecutive pages at the lowest 
vertical resolution in combination with tach of the 
five (increasing) horizontal resolutions The next 
five pages represented the next larger vertical resolu- 
tion at each horizontal resolution (for the same 
group of words) and so on. Sections A, B, C. and D 
were presented in that order in the first test book- 


Fro. 2a. Digital facsimile “grid” showing black 
white decision for each clement area. 


Fic. 2b. Printed character before and after digital 
facsimile reproduction at a resolution of 150 X 100 
elements per inch (20 X Magnification). 
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TABLE 1 


RESOLUTION COMBINATIONS (IN Square Mrs) 
Comparep N Worp Leoremry Stupy 
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TABLE 2 


LEGIBILITY as AFFECTED By WORD LENGTH AND 
FAMILIARITY FOR Two CHARACTER HEIGHTS 


Horizontal resolution 


Vertical (elements/inch) 
resolution 
(elements/inch) 
75 100 | 125 | 150 | 200 
100 133 100 80 67 50 
125 107 80 64 53 40 
150 89 67 53 44 33 
200 67 50 | 40 33 23 


let, in another sequence in the second booklet, etc. 
(Four of the 24 possible sequences of word groups 
were selected at random.) 

The procedure permitted subsequent evaluation 
of the effects of 20 element areas and independent 
evaluation of the effects of horizontal and vertical 
resolution. Since the page order within each booklet 
was in the order of increasing resolution, the S 
would be expected to identify a larger percentage 
of words on each successive page. 

The same words were read at all five horizontal 
resolutions while vertical resolution was held con- 
stant to eliminate confounding with word form and 
familiarity. This required modifying the method of 
limits (by using only an increasing series of hori- 
zontal resolutions) to prevent memorization of the 
stimulus materjal from an earlier page. This design 
was also practical in view of the limited time and 
resources available to produce the stimulus materials 
and conduct the experiment. Pilot experimentation 
prior to the main experiment indicated negligible 
practice effects when only the ascending series was 
used, 

The Ss for the experiment were 72 college students 
with 20/20 vision, natural or corrected, who volun- 
teered for the task and received a fixed compensation. 
Each S was tested separately in a well-lighted booth. 
He was told that the object of the study was to 
determine how printing resolution was related to the 
legibility of letters and words. He was then asked to 
read aloud as many of the words and letters as pos- 
sible in one of the four test booklets, page by page 
from front to back. If he could not pronounce a 
word, he was allowed to spell it, but the word was 
not considered identified unless every letter was 
named correctly (i e., each word was considered as 
a unit). Proceeding at his own pace, each S spent 
approximately 1 hr. at the task. The “answers” re- 
corded on coded sheets by the E were subsequently 
transferred to punched cards, and legibility results 
were calculated by means of programs written for the 
IBM 1620 computer, 


RESULTS 


From the accumulated data, the computer 
calculated the percentage of correctly iden- 


Average percent legibility 


ar 40-mil characters 60-mil characters 
| 
Familiar | Unfamiliar | Familiar | Unfamiliar 

3letters | 43.66 18.01 
4 letters 84.31 70.07 
Sletters | 40.83 16.80 

ó letters 89.81 64.21 

7 letters | 43.03 25.62 

8 letters 91.46 77.48 


tified words for each character size as a func- 
tion of word length, familiarity, and vertical 
and horizontal resolution. Most of the 20 
resolution combinations produced 100% legi- 
bility for words with 100-mil and 80-mil char- 
acters. The ranges were 93-100% and 70- 
100%, respectively. For 60-mil words the 
lower limit dropped sharply, the range being 
30-100% ; for 40-mil words, legibility ranged 
0-100%. 

This preliminary process of elimination 
made it apparent that the more detailed analy- 
sis should concentrate on the shorter char- 
acters, since the range of legibility for the 
taller was so limited. An analysis of variance 
was therefore performed for the 40-mil and 
60-mil words to determine the effects (if any) 
of word length and familiarity. The results, 
summarized in Table 2, show statistically sig- 
nificant differences between familiar and un- 
familiar words. For word length, on the other 
hand, the F ratio indicated negligible effects 
(see Table 3). The data were therefore com- 
bined across all word lengths for the remain- 
der of the analysis. 

The three remaining categories—letters, 
familiar words, and unfamiliar words—were 
compared as shown in Figure 3, with vertical 
and horizontal resolution combined to make 
element area the unit of resolution. (Subse- 
quent interpretations of the data took into 
account the possible effect of the proportions 
of that unit, and considered vertical and 
horizontal resolution separately.) As in the 
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TABLE 3 


Anatysis Or Variance Dur to Worp Lexcta 
AND FAMILIARITY FOR Two CHARACTER 


HEIGHTS 
Item df MS F ratio 

40-mil characters 
Familiarity (F) 1 15,000.59 14.46* 
Word length (L) 2 310.85 30 
FXL 1 190.61 18 
Error 114 1,037.56 =x 

60-mil characters 

| 

Familiarity (F) 1 9,655.20 15.54 
Word length (L) 2 72³.93 1.17 
FXL 2 440.99 a71 
Error 114 621.38 — 


Note. Critical ratio for Fisioo = 6.90; for Fa ue it is 4.82. 
*p <01. 


Erdmann and Neal (1966) study of individual 
characters, legibility systematically increased 
with character height and with resolution (i.e, 
with smaller element areas). As to the relative 
legibility of words and letters, and the effect 
of familiarity, the results showed familiar 
words and individual letters to be consistently 


LEGIBILITY (PERCENT) 


70 85 100 i 
ELEMENT AREA (MILS?) 


—.—.— FAMILIAR WORDS 
UNFAMILIAR WORDS 
—— LETTERS 


Fic. 3. Legibility of familiar and unfamiliar words, 
and of letters, at four character heights. 
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HORIZONTAL RESOLUTION (ELEMENTS/INCH) 


200 


Fic. 4. Legibility of 60-mil characters, showing sepa- 
rate effects of horizontal and vertical resolution. 


more legible than unfamiliar words, at all 
character heights. 

The independent effects of vertical and 
horizontal resolution were plotted in a repre- 
sentative graph for one character height (60 
mils). As that graph (Figure 4) shows, the 
legibility of both familiar and unfamiliar 
words increased steadily with improved resolu- 
tion in either direction. Again, familiar words 
were consistently the more legible. Another 
approach (see Figure 5) assumed a goal of 


150 F —.—.— FAMILIAR WORDS 
UNFAMILIAR WORDS 
= LETTERS 


VERTICAL RESOLUTION (ELEMENTS/INCH) 


INCREASING RESOLUTION 


200 150 125 100 
HORIZONTAL RESOLUTION (ELEMENTS/INCH) 


Fic. 5. Horizontal and vertical resolution required 
for 90% legibility with 60-mil characters. 
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90% legibility with 60-mil characters, and 
plotted the corresponding resolutions, Famil- 
iar words reached that legibility level at the 
lowest resolution; individual letters required 
greater resolution, and unfamiliar words, still 
greater, 

The results of the experiment can be seen 
most graphically in Figure 6, where all the 
data points are plotted for legibility levels 
below 100%. The open circles represent av- 
erage legibility for familiar words (by word 
length) and black dots represent unfamiliar 
words. At points below the dashed line, word 
legibility is higher than letter legibility. The 
solid curves which enclose the data points are 
labeled to summarize the data trends. Word 
legibility varied considerably in some cases in 
relation to a given level of letter legibility. 
The extreme case—a range of more than 50% 
—was observed at about 40% letter legibility. 
Word legibility varies less, of course, near the 
O and 100% levels of letter legibility. The 
product-moment correlations for the relation- 
ship between letter and word legibility are: 
-98 for familiar words, .94 for unfamiliar 
words, and .93 for words regardless of degree 
of familiarity. 

These correlations depend in part upon the 
particular experimental conditions. If data 
had been collected only at points where the 
legibility was 0 or 100%, the correlations 
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MINIMUM LEGIBILITY 
(UNFAMILIAR WORDS) 


0 25 5 - 7 100 
LEGIBILITY OF INDIVIDUAL CHARACTERS (PERCENT) 


Fic. 6. Word legibility versus character legibility, 
as affected by word familiarity. (Open dots represent 
average legibility for familiar words; closed dots, for 
unfamiliar words.) 
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would, of course, be unity. In practice, how- 
ever, the data points for the correlations show 
variability at each legibility level. Thus one 
would expect that for any reasonable distri- 
bution of conditions the trends indicated in 
this study would be verified; namely, the 
legibility of individual letters is more highly 
correlated with the legibility of familiar words 
than with that of unfamiliar words, 


Discussion AND CONCLUSIONS 

Up to a point, the word legibility experi- 
ment serves to corroborate the findings of the 
related character legibility study (Erdmann & 
Neal, 1966). The results are comparable with 
respect to the effect of vertical and horizontal 
resolution on legibility for selected character 
heights. As expected, the taller the characters 
in use, the more legible the word at any given 
resolution. Correspondingly, for a given char- 
acter height, word legibility increased with 
increasing resolution. 

The specific purpose of the present study 
was served by the results which relate word 
legibility to character legibility, in the inter- 
est of predicting the former from the latter 
(which may be more readily measured). Pre- 
vious work (Tinker, 1932, 1963) has led to 
the conclusion that words in general are more 
legible than individual lowercase letters. The 
present experiment showed this to be true 
only for familiar words at high levels of legi- 
bility. If a word was familiar to him, the § 
needed only a few legible letters to enable 
him to identify it. If it was unfamiliar, more 
of the letters needed to be legible to make 
the word legible. At the extreme, when S 
spelled out unfamiliar words, word legibility 
was found to be lower on the average than 
for individual letters, since no partial credit 
was allowed for words, 

The results are of course dependent upon 
the definition of word legibility which con- 
siders a word as a unit. A word is either 
recognized or not, and no credit is given un- 
less all the characters in the word are iden- 
tified. This definition follows the convention 
set by Tinker (1932) and employed in cur- 
rent legibility research (Kinney & Showman, 
1967; Shurtleff, 1967). 

If characters were grouped into words but 
credit was given for individual characters, one 


iki expect familiar words to score some- 

higher than these groups of random 
racters. Longer familiar words might score 
igher than shorter words up to some limit, 
yond which the identification of a few let- 
ers no longer insured identification of the 
ord (even though length was not important 
en words were scored as a unit), 


characters), unfamiliar words should be 
approximately equal to random letters in 
gibility. Very unfamiliar words would proba- 
require the identification of every char- 
r (except where knowledge of the common 
equences of letters in English words offered 
an advantage). The important thing to note 
is that even where the measurement process 
fers, the legibility of words remains a func- 
tion of word familiarity. 
In conclusion, word familiarity was found 
o have considerable influence on unit word 
bility, whereas word length had little effect. 
familiar material, the average legibility 


al letters. However, for unfamiliar ma- 

(one might think of words in a foreign 
guage, or technical information outside 
s own field), word legibility can be ex- 
pected to be less than the average legibility 

individual letters. With correlation coeffi- 
nts ranging from .98 for familiar words to 
for all words, it is apparent that knowledge 
letter legibility is helpful in predicting 
legibility, and knowledge of word famili- 
y further increases the accuracy of such 
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EFFECT OF LANE-CLOSURE SIGNALS UPON DRIVER 
DECISION MAKING AND TRAFFIC FLOW * 


KENNETH PERCHONOK ? anp PAUL M. HURST 


Institute For Research, State College, Pennsylvania 


A decision-theoretic model was applied to driver behavior on an urban express- 
way. The model permits inferential measurement of “responsiveness versus 
confusion” and “risk-taking predisposition.” These variables, as well as 2 direct 
measures (“risk taking” and “hazard”), were measured in a field study of 
forced merging from a blocked lane. Resultant indexes were compared under 2 


experimentally manipulated methods of lane closure: 


signal closure (SC) and 


conventional closure (CC), The SC method, which provides earlier warning of 
the closure, was superior to CC in terms of lower hazard and greater responsive- 
ness versus confusion. There were also improvements in several traffic-flow 


characteristics. 


The purpose of this study was to opera- 
tionally define variables immediately involved 
in, and resulting from, the decision-making 
processes of drivers in a merging situation, 
to measure the changes in these variables 
under differential traffic-control systems, and 
hence to use these measured changes to evalu- 
ate those systems. The investigation proceeded 
in the context of stimulus-dependent response 
elicitation mediated by a decision-making 
process. 

Briefly, the model for decision making 
states that a gap of size g will be accepted 
if it is larger than a driver’s threshold, G, the 
smallest acceptable gap. It follows that P, 
(A), the probability of accepting a gap of size 
g, is equal to the probability, P(g > G), that 
g is above threshold, and that, as g grows 
larger, P, (4)= P(g > G) increases from 
O to 1. It has been argued by Hurst (1964) 
that as drivers become better able to per- 
ceive and interpret environmental information 
their evaluations of incurring risk in gap ac- 
ceptance/rejection become better, and their 
behavior becomes more consistent with re- 
spect to the immediate environment, and that 
this consistency can be measured by n, the 


1This investigation was supported by Public 
Health Service Research Grant No. AC-66, from the 
Division of Accident Prevention. The authors wish 
to thank Edward Gervais and Frank DeRose, Jr., 
and their associates on the John C. Lodge Freeway 
Traffic Surveillance and Control Research Project, 
without whose cooperation this study could not 
have been done. 

2Now at Cornell 
Buffalo, New York. 
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correlation ratio between gap “size” and ac- 
ceptance/rejection. Perchonok (1964) showed 
that this increased consistency is essentially 
equivalent to an increase in the slope of P,(A) 
for 0 < P,(A)<1. Conversely, as drivers 
become less responsive to the relevant environ- 
ment (they behave in a more confused man- 
ner), the response curve flattens out and the 
value of 7 is diminished. Thus , as an index 
of “responsiveness versus confusion,“ mea- 
sures the ability of drivers to cope with their 
respective environments. 

In addition to changes in the degree of 
responsiveness, driver behavior also reflects 
the amount of risk that drivers are willing 
to tolerate. That is, if G, the gap-acceptance 
threshold, is lowered, P(g > G) increases and 
P,(A), the probability of acceptance, in- 
creases. Thus, the gap-acceptance threshold 
reflects “risk-taking predisposition.” The ef- 
fect of change in driver responsiveness and 
risk-taking predisposition is illustrated for a 
hypothesized example in Figure 1, where, in 
comparison to Curve A, B shows decreased 
responsiveness and C shows a lowered thresh- 
old or increased risk-taking predisposition. 

It can be seen that either type of change in 
the response curve is likely to reflect the 
probability of accepting a small gap, which 
is taken as our definition of “risk taking.” 
That is, not only would a decreased threshold 
yield an increased probability of accepting a 
small gap, but decreased responsiveness would 
also tend toward that outcome. 

Finally, a fourth measure was employed as 
an index of resultant “hazard.” The measure 
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is given simply by the number of small gaps 
accepted; the greater the number of small 
gaps accepted, the greater the net hazard to 
drivers of merging vehicles and to those in the 
adjacent lane. It should be noted that al- 
though hazard is a function of risk taking, it 
is also a function of the distribution of avail- 
able gap sizes—if very few small gaps are 
presented, hazard must be low even if the 
probability of accepting a small gap, when 
presented, is high. 

In order to pursue this work most directly, 
it was decided to establish the “most relevant 
stimulus” with respect to gap-acceptance be- 
havior. Such a measure of gap size was 
selected on empirical grounds from a list of 
27 algebraic combinations of variables de- 
scribing the kinematics of the merging vehicle 
and the vehicles forming the gap (Hurst, 
Perchonok, & Seguin, 1968). 


METHOD 
Data Collection 


Data were recorded on 35-mm. film using an 
elevated camera overlooking a three-lane section of 
the John C. Lodge Freeway in Detroit, Michigan. 
The observation area was approximately 350 ft. 
long; the lane closure was established by a mainte- 
nance vehicle located in the medial lane at the down- 
stream end of that area, Data collection started at 
2:15 P.. on two successive afternoons, one for each 
type of closure system. 

Under one system, conventional closure (CC), the 
only warning given to drivers was supplied by the 
truck and a wooden panel, supporting flashing lights, 
attached to the rear of the truck. Under signal 
closure (SC) additional warning was provided by an 
overhead sign approximately 1,200 ft. upstream of 
the truck, This sign displayed electrically lighted sig- 
nals consisting of a red “X” for the closed Jane and 
a green arrow for each of the open lanes. In addi- 
tion, a traffic engineer operated a suggested speed 
indicator, at the same location, following the same 
procedures used in any normally occurring lane 
closure. ; 

The data reduction involved the transformation 
of pictorial information for each ¥, sec. into quan- 
titative form via the Pictorial Data ‘Transducer sys- 
tem developed by the Institute For Research, State 
College, Pennsylvania. This system involves a semi- 
automated film-scanning device, from which vehicu- 
lar positional data on IBM cards are obtained, and 
a battery of computer programs to organize this out- 
put into an analyzable format. A more complete de- 
scription of the system is given by Seguin and 
Perchonok (1968); the details of the transducer it- 
self are discussed by Hicks (1964). 


GAP ACCEPTANCE 


o 


PROBABILITY OF 


GAP SIZE 


Fic. 1. Three hypothetical response curves illus- 
trating differential responsiveness and risk-taking 
predisposition. 


Definitions and Formulas 


“Gap size” was measured by a decision point 
which was defined as that time, to the nearest 1⁄4 
sec., at which the longitudinal locations of the rear 
of the lead vehicle defining the gap and the front of 
the subject vehicle in the medial (closed) lane were 
coincident. For the purposes of this study only 
those gap occurrences were included for which the 
lead vehicle had a higher velocity than the subject 
vehicle. Gap size as determined by Hurst et al. 
(1968) was measured as: 


gap size = [Hr/V r] —Tr-s 


where He =the distance (ft.) between the lead and 
following vehicles, minus the length of the subject 
vehicle; Vr = speed (ft/sec) of the following ve- 
hicle; and Tris the minimum time (sec.) re- 
quired by the following vehicle to match the velocity 
of the subject vehicle assuming maximum decelera- 
tion with a coefficient of friction of .6, and a 
response time of .75 sec. 

“Responsiveness versus confusion” was measured 
by u, the correlation ratio, which is the square root 
of the proportion of acceptance/rejection behavior 
attributable to gap size. A higher value of n implies 
greater responsiveness and, therefore, less confusion. _ 

“Risk-taking predisposition” was measured by G, 
the mean value of the gap-acceptance threshold as 
computed by transforming the acceptance curve to a 
frequency distribution of thresholds and computing 
the mean as suggested by Spearman (Guilford, 1954). 

A “small gap” was defined as any gap whose size 
was less than the G computed from the pooled data. 
It was, then, any gap which the average driver 
would consider unacceptable. 

“Risk taking” was defined as the number of small 
gaps accepted divided by the number of small gaps 
presented. It was, thus, an estimate of the probabil- 
ity that a small gap would be accepted. 

“Hazard” was defined as the number of small gaps 
accepted. 


RESULTS 


Volumes under the two closure systems were 
essentially equivalent: under signal closure 
(SC) the volume was 67.1 vehicles/min, while 
under conventional closure (CC) the volume 
was 67.7 vehicles/min. However, relative 
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TABLE 1 


Lane Votumes ror Sicnar Ciosure (SC) N 
CONVENTIONAL Closuxk (CC) 


KENNETH PERCHONOK AND PauL M. Hurst 


TABLE 2 


Taree Trarric-FLlow CHARACTERISTICS FOR SIGNAL 
Crosure (SC) ann CONVENTIONAL Closure (CC) 


Proportion of 
volume by lane 


| Volumes in 


vehicles/min 
Lane 


| SC | GE sc cc 
Shoulder 31.8 30.5 47 45 
Center 30.6 25.8 46 38 
Medial 4.7 11.4 07 17 
Total 67.1 67.7 1.00 1.00 


lane volumes did change as a function of 
closure (Table 1). Table 1 further explicates 
the change in relative lane volumes in terms 
of the probabilities of entering the observation 
area in each of the three lanes. 

Chi-square tests showed the probability 
that a vehicle entering the observation area 
did so in the medial lane was significantly 
(p < .002) lower under SC and that the 
probability of entering in the shoulder lane 
was not significantly changed. Of the vehicles 
not entering the medial lane, the likelihood of 
entering the center, as opposed to shoulder, 
lane increased under SC but not significantly 
(p = .17). The relative lane volumes leaving 
the observation area were quite similar under 
the two closure systems. 

Other changes in traffic characteristics that 
were noted are shown in Table 2. The prob- 
abilities of these results occurring if chance 
alone had been in operation were sufficiently 
small to allow rejection of the hypotheses of 
“no difference.” 

The curves in Figure 2 are the empirical 
response curves under the two systems of 
closure. The two vertical lines represent the 
respective Cs. 
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Fic. 2. Proportion of gap acceptances as a function 
of gap size for two types of closure, 


Characteristic | sc | cc | $ 
Median speed of medial lane vehicles! | 
at decision point (ft/sec) 18.7 | 14.0 | .058* 
Median number of vehicles in 
observation area 16.0 | 17.0 | .042= 
Median waiting time from area en- 
trance to gap acceptance (sec.) 6.0 | 9.0 | .012 


a Significance level using one-tailed Mann-Whitney U test 
that observations under one system were stochastically higher 
than those under the other system (Siegel, 1956). 

b Two-tailed test. 


The values of the decision-making correlates 
for SC and CC along with significance levels 
for respective changes are given in Table 3. 
All differences, except that for risk taking, 
had small probabilities of chance occurrence 
and were therefore accepted as “real” dif- 
ferences. 


Discussion AND CONCLUSIONS 


The similarity of overall volumes seems to 
indicate that observed differences are reason- 
ably attributable to the type of closure, as 
opposed to the nature of the traffic. The 
relative effect of SC upon lane volumes was 
as expected: the early warning of lane closure 
was successful in inducing drivers to leave the 
medial lane causing a concomitant increase of 
volume in the center lane. Thus, the signal 
system was successful with respect to its 
primary function. 

Additional benefits included a decrease in 
the slowing of subject vehicles and a shorter 
waiting time before leaving the closed lane. 
The significant reduction of vehicle density in 
the observation area implies that vehicle 
velocities tended to be higher, and that po- 
tential capacity was increased. Thus, the ef- 
fectiveness of the SC apparently extended to 
a general improvement in traffic flow. 

The decreased confusion during SC is 
graphically exhibited in Figure 2 which ex- 
hibits a tendency toward more appropriate 
behavior than under CC; namely, the likeli- 
hood of accepting very small gaps and of re- 
jecting large gaps is lower under SC than CC. 
It should be noted that this phenomenon 
obtains even in the face of a changed thresh- 
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TABLE 3 


DECISION-MAKING CORRELATES FOR SIGNAL Crosure (SC) anp CoxvenTionaL CLosure (CC) 


Variable sc cc Test | ? 
Responsiveness versus confusion (n) .890 714 Wilcoxon* 063 
Risk-taking predisposition @) — 1.204 —.428 Wilcoxon” | 024 
Risk taking — accepted a 43 30 Fisher? . 

no. small gaps 
Hazard (no. small gaps accepted) 3 9 Chi-square* | 050 


a The Wilcoxon matched-pairs signed-rank test was used to detect stochastic differences, The directions of change in re- 


sponsiveness versus confusion” and in“ hazard" were predicted; thus one-tailed hypotheses were tested. 


> Two-tailed. 


old. It can be inferred that although drivers 
under SC exhibited a greater risk-taking pre- 
disposition—a willingness to accept smaller 
gaps—their lower level of “confusion” per- 
mitted them to better behave in correspond- 
ence with the environment, with the result 
that there were no significant differences in 
actual risk taking by individual drivers. 
Thus, the effect of SC in reducing “confusion” 
paid off. The early alerting to lane closure ap- 
parently allowed drivers to better assimilate 
information about the environment and thus 
to respond more appropriately to it. 

The increased risk-taking predisposition 
under SC was unexpected. An explanation, al- 
though ad hoc, is available. It had been ob- 
served via the television surveillance system 
in Detroit that some drivers ignore the lane- 
closure signal and remain in the closed lane, 
taking advantage of its reduced volume, Since 
drivers receive an earlier warning under Sc, 
it seems appropriate to assume that of the 
drivers in the observation area a greater pro- 
portion were there by choice under SC as 
compared to CC. Thus, instead of interpreting 
the increased predisposition toward risk tak- 
ing as a product of the situation in which the 
driver finds himself, it seems likely that the 
early warning acts as a selection device en- 
couraging the less aggressive driver to leave 
the closed lane, Thus, the more aggressive 
driver remained to take advantage of the 
reduced medial lane volume at the possible 
cost of encountering higher risk when, as he 
ultimately must, he changes lanes. 


Finally, it can be seen that measured “haz- 
ard” was greater under CC than SC. The 
differential comparison between “risk taking” 
and “hazard” is attributable to the reduced 
medial lane volume under SC. Here, again, 
the success of the signal system is obvious. 

It may be concluded that the SC system 
provided a significant safety improvement 
over the CC system in terms of reduced ob- 
jective “hazard” in gap acceptance. The 
means whereby this was achieved were a re- 
duction in the proportion of vehicles entering 
the observation area in the closed lane and 
(by inference) a decrease in “confusion” by 
drivers of merging vehicles. 
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LEADERSHIP PERFORMANCE OF NURSING SUPERVISORS 


AT TWO ORGANIZATIONAL LEVELS * 


STANLEY M. NEALEY axo MILTON R BLOOD? 
University of Minois 


The relations of leader style and leader behavior to work-group performance 
and subordinate job satisfaction have been investigated for 22 Ist-level and 
8 2nd-level nursing supervisors in a Veterans Administration Hospital of 1680 
beds. Task-oriented (low least preferred co - worker LC) leaders received 
higher performance ratings at the Ist level of supervision, while relationship- 
oriented (high LPC) leaders performed better at the 2nd level of supervision. 
Subordinate'’s job satisfaction was positively related to leader Consideration at 
both levels of supervision, However, Initiating Structure leader behavior con- 
tributed to high subordinate job satisfaction at the Ist level of supervision but 
low subordinate job satisfaction at the 2nd level. These differences in effective 
supervisory patterns point to differences in situational leadership demands at the 


2 supervisory levels. 


The present study was designed to explore 
in a field setting the relations of leadership 
style and leadership behavior on the one hand 
to measures of work-group performance and 
work-group satisfaction on the other, and to 
compare these relations across two organiza- 
tional levels. In other words the aim is to 
discover if and how leadership demands dif- 
fer by organizational level. 

There are several reasons for hypothesizing 
that the first-level supervisor and his boss, the 
second-level supervisor, will face different 
leadership demands. For one thing, leader- 
ship theory and leadership research have for 
several decades been moving away from the 
idea of universal leadership traits and the 
“one best way” to lead. In place of these 
earlier oversimplifications, leadership effective- 
ness is now generally conceded to involve an 
interaction between the leader’s characteris- 
tics, his behavior, the nature of the followers, 
and the characteristics of the particular lead- 
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ership situation, including the nature of the 
task and the organizational setting. Terman 
(1904) initially suggested this approach, and 
studies by Carter, Haythorn, Shriver, and 
Lanzetta (1951), Hemphill (1949), Stogdill 
(1948), Flanagan (1949), and many others 
(see Gibb, 1954) have led to its general ac- 
ceptance, The interactional conception of 
leadership is the basis for Fiedler’s (1964, 
1967) model of leadership effectiveness and 
figured prominently in reviews of research on 
supervision and productivity by Dubin 
(1965) and Sales (1966). The point is that 
a number of situational variables, for in- 
stance, follower characteristics, leader power, 
and group task, may vary markedly by 
organizational level. Leader behavior, in order 
to be effective, may need to vary accordingly. 

A number of writers in industrial adminis- 
tration, among them Barnard (1950), Niles 
(1949), and Pfiffner and Sherwood (1960), 
have hypothesized that management func- 
tions do vary from level to level within organi- 
zations. Georgopoulos and Mann (1962) 
found that technical skills become less im- 
portant as managers move upward in organi- 
zations. Argyris (1964) and Katz and Kahn 
(1966) have expressed similar views. 

A partially contrasting perspective on the 
above question of differential demands by 
management level is provided by several 
studies in which managers at different organi- 
zational levels have expressed their opinions 
about what qualities lead to success (see 
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Fleishman, 1953; Pellegrin & Coates, 1957; 
Porter & Henry, 1964). These studies all 
found some differences by level, but these dit- 


| ferences were not striking. On the whole, the 


managers surveyed did not feel that effective 
patterns of leadership attributes differ greatly 
by organizational level. Related studies by 
Rosen (1961) and by Triandis (195%, 
1959b) reinforce this conclusion. 

The opinions of industrial administration 
scholars and those of managers thus seem in 
only partial agreement that organizational 
level is an important variable in determining 
which pattern of leadership behavior will be 
most effective. Whatever may be the eventual 
empirical resolution of these differences, if we 
are willing to entertain the notion that the 
leadership demands on top managers may be 
different in kind as well as degree from the 
leadership demands at lower levels, then we 
must face the possibility that experience at 
lower levels may have minimal value, or may 
even be disadvantageous, to effective perform- 
ance at higher levels in the organization. 


METHOD 
Research Site and Subjects 


This study was conducted in a Veterans Admin- 
istration hospital of 1,680 beds. The Ss were nursing 
personnel at four levels in the nursing service as 
follows: 248 nursing assistants, 43 staff nurses, 22 
head nurses, and 8 unit supervising nurses. All Ss 
except nursing assistants were RNs. 

The aspect of the study which is of concern here 
is a comparison of certain leadership variables across 
two levels of the nursing staff. For this purpose head 
nurses will be considered as first-level supervisors 
and will be referred to as such from here on. Each 
head nurse supervised directly three to eight nursing 
assistants as well as one or two staff nurses. The 
hospital was organized into six units, each with 
roughly equivalent functions. The nursing function 
in each unit was under the supervision of a unit 
supervising nurse who reported directly to the hos- 
pital’s chief of nursing. An evening supervising nurse 
and a night supervising nurse had responsibility for 
the nursing function for the whole hospital during 
their respective shifts. Fon pe of 1 these 
two have been lumped e six uni 
and these eight nurses will hereafter be referred to 
as second-level supervisors. 


Predictor Variables 
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isfaction of the immediate subordinates of each 
supervisor. Thus the subordinate job- satisfaction 
score of second-level supervisors consisted of the 
mean job satisfaction of the head nurses and staff 
nurses who reported to them. Likewise the subordi- 
nate job-satisfaction score of a first-level supervisor 
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subordinates as a group. The leadership power rating 
consisted of nine items. Five items measured the su- 
pervisor’s “expert power” in doing the work, for ex- 
ample, how well the supervisor knows the work rela- 
tive to her subordinates, the amount of knowledge 
and skill the supervisor has, and her ability to give 
explicit and detailed guidance to subordinates. Four 
items measured the supervisor's “delegated power,” for 
example, her degree of authority to reward or punish 
subordinates, her influence with superiors in having 


recommendations accepted, and her power to set 


TABLE 2 


CORRELATIONS BETWEEN LEADERS’ Least PRE- 
FERRED CO-WORKER SCORE AND RATED 
LEADER POWER AT Two LEVELS 
OF SUPERVISION 


Sraxtey M. Nearey anp Micron R. Bioop 


level These two scores have been com. 
bined as a measure of leader power in some of cer 
analyses. As shall be seen later, this rating of the 
leader's power was highly correlated with the ratings 
of work-group performance in the present samples, 
The measure of leader’s esteem for subordinates con- 
sisted of 10 evaluative bipolar adjective scales * plus 


One 
ANOVAs were performed using the leader's LPC 
score and the leader's esteem for subordinates as inde- 
pendent variables. Another set of ANOVAs involved 
the leader-behavior variables of Consideration and 
Initiating Structure as independent variables. These 
ANOVAs produced a number of significant main ef- 
fects at both supervisory levels, but significant inter- 
actions occurred only at the first level of supervision. 
ANOVA results will be reported only for these sig- 
nificant interactions. In other words, correlations will 
be reported rather than ANOVA main effects. 


RESULTS 
LPC Scores as Predictors 


The correlations between the leadership 
style measure (LPC) and the four ratings 
of performance by superiors are shown in 
Table 1. As can be seen the correlations differ 
in sign at the two supervisory levels. In gen- 
eral the high LPC (relations oriented) sec- 
ond-level supervisors received better ratings 
from superiors, while at the first level of 
supervision low LPC (task oriented) nurses 
were judged the better supervisors. However, 
none of the correlations at the first level of 
supervision quite reaches the two-tailed sig- 
nificance levels used for all significance tests 
throughout this paper. Column 3 of Table 1 
shows that the correlations between LPC and 
performance were significantly different at 
the two levels of supervision. 


These 10 items are identical to Fiedler's (1967, 


Appendix A) “group atmosphere” scale. The part- 


whole correlation between the sum of these 10 items 
and the sum of the total 15 items was .958 among 
the first-level supervisors. 


Ratings of the leadership power enjoyed by 
0 ors at both levels were made by su- 
perior: just as were the ratings of perform 
asce criteria. Raters were instructed to rate 
expert and delegated power of the super- 
herself, but to rate the whole work 
$ on the performance criteria. Despite 
different emphasis, the power and cri- 
Hon ratings turned out to be highly corre- 
d at both supervisory levels. The power 
lal score and the global rating of perform- 
ce were correlated .884 and 856 (p < 01 
both cases) at the first and second levels, 
pectively, of supervision. Therefore, the 
er ratings are treated as another indica- 
tion of the excellence of job performance as 
n by superiors. Table 2 shows the corre- 
lations between leader LPC scores and rated 
leadership power at the two levels. Again 
high LPC (relation oriented) second-level 
nervisors receive good ratings, and again 
LPC predicts in different directions at the 
_ two levels. 

ire correlations between leadership style 
mie ) and the job satisfaction of subordi- 
hates are shown in Table 3. The LPC score 
ol supervisors did not correlate significantly 
Wich any area of subordinate job 

or with the sum of the five job-satisfaction 


_ Before moving on to the correlations be- 
tween leader behavior, as perceived by sub- 
= ordinates, and the criterion variables, the 
relations among the two types of leadership 
predictor variables should be reported. Table 
4 shows the correlations a 42 leader- 
Ship style measure (LPC) an g — 
S r dimensions Consideration * 
tiating Structure. None of the correlations in 
Table 4 approaches significance. Apparently 
these two types of predictor variables were 
quite independent. Readers who may have 
fallen into the habit of interpreting high LPC 
scores as indicating high Consideration and 
low LPC scores as indicating high Initiating 
Structure should take note of this finding. 


LBO Scores as Predictors 


Descriptions of leader behavior as per 
ceived by subordinates failed to predict rat- 
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TAB 
CORRELATIONS BETWEEN LEADER BEHAVIOR 


Sranrey M. NEALEY AND MILTON R. BLOOD 


LE 5 
AND SUPERIORS’ RATINGS OF PERFORMANCE 


AND Power at Two LEVELS or SUPERVISION 


First-level Second-level 
supervisors supervisors Across levels Within levels 
(N =22) (N=8) 
Superiors’ ratings 
Initiating | Consider- | Initiating | Consider- 
Structure} ation Structure] ation | p(i—r:) Cr PO pro 
71 72 75 74 
Physical patient care 107 131 -150 012 ns ns ns ns 
Information about 
patients 2⁴⁰ 313 -090 306 ns ns ns ns 
Human relations 010 -103 057 302 ns ns ns ns 
Global performance -413 077 232 —.015 ns ns <.058 ns 
Expert power .494* .162 —.144 -409 ns ns <.05 ns 
Delegated power 098 —.272 201 157 ns ns <.05 ns 
Total power 362 —.009 —.017 333 ns ns <.05 ns 


This significance level may seem surprising since r =.413, df 
tween r =.4 13 and r=,077 is significant because Consideration 
5 S. 05. 


While leader behavior as rated by subordi- 
nates was only slightly correlated with per- 
formance ratings by superiors, it was closely 
related to subordinate job satisfaction. Table 
6 shows the correlations between LBDQ 
Scores and job satisfaction as measured by 
the JDI. At the first level of supervision both 
Consideration and Initiating Structure are 
Positively related to all five aspects of sub- 
ordinates’ job satisfaction and to the job- 
satisfaction sum. All but two of these rela- 
tions are significant beyond the .05 level, 


TAB 


=20, is not significantly different from zero. The difference be- 


and Initiating Structure are correlated .704 in this sample. 


As Column 7 of Table 6 shows, Considera- 
tion was correlated with subordinate job sat- 
isfaction more highly than was Initiating 
Structure. Clearly, nursing assistants posi- 
tively valued both Consideration and Initiat- 
ing Structure behavior on the part of the 
first-level supervisor, but they valued the 
former more than the latter. At the second 
level of supervision the situation appeared 
somewhat different. Table 6 shows that Con- 
sideration was on the whole positively related 
to the job satisfaction of subordinates, but 


LE 6 


CORRELATIONS BETWEEN LEADER BEHAVIOR AND SUBORDINATE JOB SATISFACTION 


AT Two LEVELS 


OF SUPERVISION 


First-level Second-level | 
supervisors supervisors Across levels Within levels 
(N=22) (N=8) 
Job-satisfaction 
area pee 5 
Initiating | Consider- | Initiating | Consider- 
Structure ation Structure ation DCI ri) Cr bir) | Caro 
71 72 73 74 
Work .520* 762 087 -306 ns ns <.05 ns 
Supervision 557%* 790 | —.712* 820 | <.01 ns <.05 <.05 
People OIE pone 494 —.144 ns <.05 ns ns 
Pay 256 .524* —.537 .263 ns ns ns ns 
Promotion .348 622 —. 669 289 2.05 ns ns ns 
Total satisfaction .501* .809** —.170 .624 ns ns <.01 ns 


LEADERSHIP OF NURSING SUPERVISORS 


that Initiating Structure scores of second-level 
supervisors tended to be negatively related to 
the job satisfaction of their subordinates. The 
correlations between the leader-behavior di- 
mensions and satisfaction with supervision 
were not only significantly different from 
zero, in opposite directions, but were signifi- 
cantly different from one another (p < .05). 
In this situation it appeared the first-level 
supervisors wanted their own superior to 
evidence Consideration but not Initiating 
Structure behavior. This is in contrast to the 
first-level supervisor’s own subordinates, the 
nursing assistants, who valued both Consid- 
eration and Initiating Structure. 

The above findings involving LBDQ scores 
should be interpreted in the light of the rela- 
tions between the Consideration and Initiat- 
ing Structure dimensions themselves. Among 
second-level supervisors these two dimensions 
were correlated —.570, while at the first level 
of supervision they were correlated +.704, 
p< .01. These two dimensions of the LBDQ 
are, of course, designed to be orthogonal. The 
12 items in each scale were selected because 
of their orthogonal factor loadings based on 
an industrial population as reported by Stog- 
dill and Coons (1957, pp. 108-109). While 
Stogdill and Coons report (p. 110) that their 
dimension scores intercorrelated only —.02 
in an additional sample of 122 foremen, the 
results of this study indicate a much lower 
degree of factor independence. As a check the 
LBDQ scores from the total sample of 251 
Ss who had completed the LBDQ were factor 
analyzed. The principal axis factor analysis 
even without rotation strongly supported the 
two-dimensional factor structure and a priori 
item scoring of the present data. Nevertheless, 
the high intercorrelations between the Consid- 
eration and Initiating Structure item com- 
posites in the samples of this study and the 
fact that these intercorrelations differed in 
direction from one organization level to an- 
other should be kept in mind. The authors 
would further recommend that the factor in- 
dependence of the LBDQ dimensions be 
checked as a matter of course in future studies 
using this instrument, and that these checks 
be carried out separately for groups of Ss in 
Separate situations or experimental treat- 
ments. 
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Fic. 1. Significant ANOVA interaction at the first 
level of supervision. The leaders’ least preferred co- 
worker (LPC) score and esteem for subordinates 
interact to determine satisfaction with supervision. 


SATISFACTION WITH SUPERVISOR 


As previously noted, two-way analyses of 
variance were performed to see if the leader- 
ship predictor variables would interact signifi- 
cantly to yield differences on the criterion 
variables. For the most part they did not, but 
at the first level of supervision there occurred 
a set of significant interactions that are rather 
revealing. These analyses featured as inde- 
pendent variables the leadership style measure 
(LPC) and the leader’s esteem for her sub- 
ordinates (these two variables were corre- 
lated —.008, N = 21). 

The first-level leader’s LPC score and her 
esteem for her subordinates interacted to pro- 
duce differences in her subordinates’ satis- 
faction with supervision (F = 5.255, df= 
1/18, p < .05) and total job satisfaction (F 
= 6.992, df=1/18, p< .05). These inter- 
actions are charted in Figures 1 and 2. As 
can be seen, first-level supervisors with high 
LPC scores and high esteem for their sub- 
ordinates stand out as having highly dissatis- 
fied subordinates. This finding was partially 
explained by the fact that the same two inde- 
pendent variables (LPC and esteem for sub- 
ordinates) interacted to produce significant 
differences in the Consideration ratings of 
these first-level supervisors (F = 5.166, df= 
1/18, p < .05). This interaction is charted in 
Figure 3. Nurses with high LPC scores and 
high esteem for subordinates were rated by 
those subordinates as lowest in Consideration 
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Feo. 2, Significant ANOVA interaction at the 
first level of supervision. The leaders’ least preferred 
co-worker (LPC) score and esteem for subordinates 
interact to determine total job satisfaction. 


behavior. Although the interaction did not 
reach significance, this same group of nurses 
was also lowest in Initiating Structure be- 
havior. In other words, these high LPC first- 
level supervisors who like and trusted their 
subordinates were seen by them as generally 
“inactive” in leadership behavior. Their sub- 
ordinates in turn were highly dissatisfied with 
their supervision and with their total jobs. 
Relations b Criteri 


At each of the two supervisory levels the 
correlations were computed between the five 
aspects of job satisfaction of subordinates and 
the four ratings of performance made by 
superiors. None of the 20 correlations at 
either level reached statistical significance. 
It is interesting to note, however, that 18 out 
of 20 of the correlations at the first level of 
supervision are positive, while 13 out of 20 
are negative at the second level of supervision. 
A test of the frequency of plus and minus 
correlations at the two supervisory levels 
yielded * = 12.90, df=1, p< 001. It 
should be pointed out that the 40 correlations 
on which this x? is based are not independent 
since there are substantial positive correla- 
tions within the JDI subscores and within the 
performance rating areas. The resulting x? is 
thus difficult to interpret. Conservatism 
compels us to stop short of the tempting con- 
clusion that the role expectations sent from 
superior and subordinate are compatible in 
the case of the first-level supervisor but con- 
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FIRST-LEVEL SUPERVISORS 


HIGH 
ESTEEM FOR SUBORDINATES 


LOW 


Fic. J. Significant ANOVA interaction at the first 
level of supervision. The leaders’ least preferred co- 
worker (LPC) score and esteem for subordinates 
interact to determine Consideration. 


tradictory in the case of the second-level 
supervisor; and that here, contrary to the 
usual situation, the second-level supervisor 
is really the “man in the middle” caught be- 
tween conflicting demands from superiors and 
subordinates. 


DISCUSSION 


The present study was based on the single 
simple hypothesis that effective leadership 
style and leadership behavior patterns would 
differ across supervisory levels. This hypothe- 
sis was confirmed. In general, good perform- 
ance ratings by superiors went to high LPC 
(relations oriented) second-level supervisors 
and to low LPC (task oriented) first-level 
supervisors (see Tables 1 and 2). Taking 
subordinate job satisfaction as a leader goal, 
it was found that Consideration behavior on 

part of the leader was positively valued 
by subordinates at both supervisory levels- 
Initiating Structure behavior was also posi- 
tively valued by subordinates at the first level 
of supervision, but was negatively valued at 
the second level of supervision. 

A post hoc interpretation of the job-satis- 
faction findings comes readily to hand. The 
subordinates of second-level supervisors were 
RNs and shared the same professional qualifi- 
cations as their leader. They liked her to be 
considerate but they did not want her to 
structure their jobs for them. On the other 


(1967, Chapter II) recent 


s looked to them for both considera- 
and structure because they did not have 
professional qualifications to handle py 
patients alone. Their leader was 


jonally qualified and they looked to 
her for structure. This suggests that the 
unt of Initiating Structure behavior a 


gubordinate welcomes from a superior is 


a function of the difference between 
in the degree of task expertise they 
Since Initiating Structure and Considera- 


“Hon were highly positively correlated at the 


level of supervision one may think in 

ms of active“ leaders and “inactive” lead- 
ets at this level. The inactive first-level super- 
Yisors tended to be relations oriented (high 


LPC) and to have high esteem for their sub- 


ordinates. This finding fits well with Fiedler’s 
that 
high LPC leaders who feel satisfied with their 
interpersonal relations with group members 


“will sit back and take it easy; that is, they 


Will be relatively inactive in dealing with 
the group. The remarkable thing in this case 
however is that the high esteem these first- 

leaders felt for their subordinates was 
Negatively reciprocated. Since the subordi- 


nate's satisfaction with the leader and the 


Teader’s esteem for the subordinate are two 
Sides of the same relational interface one 
would expect them to be significantly posi- 
tively related. 

The present findings lend some confirma- 


‘tion to claims that different 


will place different demands on leaders. 
While the authors do not expect the particu- 
lar patterns discovered in the present study 
to generalize very far, they would expect dif- 
ferences in leadership demands to occur when- 
ever the leadership situation differs sharply 
from level to level. In particular, two aspects 
of the present sample should suggest cau- 
tion against generalizing these findings di- 
rectly to other populations. First, one must 
consider that a nursing service may be a 
special kind of organization. Second, moving 
up through the levels of a hierarchy, n be- 
comes smaller at each level. In this case, u at 
the second level dropped to 8. A correlation in 
such a small sample can be strongly influ- 
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RELATIONSHIP OF MEMORY OPTIMISM TO WORK 
COMPETENCY AND PERSONALITY VARIABLES 


H. MELTZER axe D. LUDWIG! 
Washington University 


on the job, having a higher level of autonomy, and rating 
personal competence. Workers who had high economic values were lest often 


The purpose of this study was to investi- 
two problems related to personality 
yariables—memory dynamics and work com- 
petency. In the area of memory dynamics the 
field is relatively barren as far as workers as 
Ss are concerned. Meltzer and Ludwig (1967) 
f that, although younger workers are sig- 
nificantly more often memory optimists than 
older workers, each age group expressed sig- 
Nificantly more pleasant memories than un- 
pleasant. These findings were interpreted to 
‘Suggest that personality factors rather than 
the age factor as such are influencing memory 
ics involved. In the only study previ- 

ously published on college students, 
(1952) found a relationship between person- 
_ ality factors and security feelings as mea- 
sured by the Maslow Security-Insecurity Test 
and discovered that features of childhood and 
adult memories alike were also shown to be 
significantly related to security scores. This 
he interpreted as an initial advance toward 
detailing personality and memory functions. 
An earlier investigator, Waldfogel (1948), on 
the basis of a lengthy study of the affective 


1 An earlier version of this article was presented 
at the meeting of the Midwestern Psychological As- 
Sociation, Chicago, May 1967. 


memory optimists, less competent on the job, and tended to be older 


was to investigate work competency to person- 
ality variables, there will be no review of the 
literature on job satisfaction and performance. 
Instead, interested readers may refer to the 
oft-quoted review by Brayfield and Crockett 
(1955), Triandis’s (1959) critique, and the 
comprehensive chapter devoted to this topic 
by Vroom (1964), particularly pages 181- 
187. One of the earlier studies concerned 
with personality variables rather than job 
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satisfaction is that of Georgopoulos (1957) in 
which he reported that workers who perceive 
that effective performance is a path to their 
goals are more productive. The stronger the 
need, the more will this relationship hold. In 
a study of blue-collar workers, Friedlander 
(1966) found no difference between high and 
low performance. He found that low perform- 
ers found recognition through advancement to 
be more important and that high performers 
attached more importance to opportunities 
for self-actualization in white-collar govern- 
ment workers, but he found no significant dif- 
ferences in blue-collar workers. Guion and 
Gottier (1965) found very little indication of 
a significant relationship between personality 
test results and prediction of performance. In 
this present study an attempt was made to 
investigate work competency as rated by in- 
terviewers with two personality variables, 
autonomy and interpersonal competence as 
well as memory dynamics. Also explored was 
the relationship of personified values to mem- 
ory optimism and work competency. In the 
light of the foregoing studies, as well as the 
authors’ previous study, three hypotheses are 
made. These can be expressed as follows: 

1. Memory optimists will generally be more 
adequate as workers, more autonomous, and 
will rate higher in interpersonal competence. 

2. People who rated as more competent 
workers will tend to be memory optimists, 
more autonomous, rate higher in interpersonal 
competence, and typically will be younger. 

3. Workers who are more economic minded 
as compared to those who are more political 
minded (in the Allport-Vernon-Lindzey sense) 
will less often be memory optimists, rate less 
competent on the job, and will tend to be 
older, 

Since the nature of the study will be more 
clearly understood in the light of knowledge 
of the source material and Ss used, as well 
as the industrial setting of the community in 
which the study was made, next will be pre- 
sented the method and procedures where this 
material is described. 


METHOD 
Subjects and Setting 


The Ss used in this study were 143 workers in a 
paper converting industry located in a stable college 


H. MELTZER AND D. Lupwic 


community in upstate New York. This paper mill 
is the only plant of any size in the community and 
obtaining a job there at that time was considered 
by a good many of the people living in the com- 
munity as the beginning of establishment. For the 
most part the workers were semiskilled and skilled, 
with large families and strong feelings about their 
positions with the company, jealousy over promo- 
tions of others, and pride in their own advance, 
The workers ranged in age from 19 to 78 and repre- 
sented about one-third of all of the workers in the 
plant. The 143 were selected from the larger group 
because they were the only ones interviewed for 
whom data concerning memories of pleasant and 
unpleasant experiences as well as personified values 
were available. 

The source material used was based on initial 
interviews with workers who were taken over by the 
new company. The interviewers were representatives 
of the new company and the attitudes toward them 
were extremely favorable because of dissatisfaction 
with the old company. The interviewers, in addition 
to the senior author, were graduates of Washington 
University with a minimum of 4 yr. of interviewing 
experience. The interview was structured in the 
following sequence (Meltzer, 1950): 


1. Directed questioning about work, family, per- 
sonal story, and present conditions or problems. 

2. Recall of the most outstanding pleasant experi- 
ence with question, “And now tell me what comes 
to your mind when you think of the most outstand- 
ing pleasant experience of your life.” 

3. Recall of other outstanding pleasant experiences. 

4. Recall of the most outstanding unpleasant 
experience. 

5. Recall of other outstanding unpleasant experi- 
ences. 

6. Questions concerning personified values. 

(a) “Who is the greatest person that ever 
lived? Why ?” 
(b) “Who is the greatest person living? Why?” 

J. Questions concerning personified ideal or aspira- 
tion: “Of all the people you've ever seen, heard about 


or read about, whom would you rather be like? 
Why?” 


Memory optimism was determined by predomi- 
nance of pleasantness over unpleasantness in the 
experiences given by the worker, memory pessimism 
was determined by the predominance of unpleasant- 
ness over pleasantness, and indifferentism by an equal 
number of responses, 

Work competency was based on the rating by the 
interviewers at the end of the interview. The inter- 
viewers had available for use the ratings of the 
respective supervisors. Scores for various ratings 
were as follows: 1 for D; 2, D+; 3, C; 4, C+; 5, Bi 
6, B+; 7, A; 8, A+, 

Autonomy was determined on the basis of per- 
sonal and work histories given by the worker in the 
interview, and a content analysis, The worker was 
given a plus (+) in autonomy if the histories 
contained statements indicating adequate security and 
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adjustment; minus (—) indicated insecurity and mal- 
adjustment. A zero (0) was given if no clearer assess- 
ment could be made. A second judge made inde- 
pendent ratings of the same factors with the resulting 
correlation of .6 between the two. (A 3X3 con- 
tingency table was used with the maximum cor- 
relation being .82.) 

Interpersonal competency was also determined by 
a content analysis. A worker was given a + in 
interpersonal competence if the histories contained 
statements indicating that the worker got along well 
with others; if there were statements indicating a 
history of poor relations with others, a — was 
given; and a O was given if no clearer judgment 
could be made. The rating of the second judge cor- 
related .64 in this measure. (A 3X3 contingency 
table was used with the maximum correlation 
being .82.) 

Personified values were based on reactions to 
personifications of greatest ever lived, greatest living, 
and model.2 The predominant value was assigned to 
each S, and of the seven values assigned only the 
four with sufficient frequency are reported. The 
ratings of the second judge correlated .68 for the 
two most frequent values. 


RESULTS 


Relationship of Memory Optimism to 
Autonomy and Interpersonal 
Competence 


The workers were divided into two groups 
according to the memory optimists index; 
those with an index of +2 and above were 
put in the upper group (more optimistic). 
Those with an index of +1 and below were 
put in a lower group (less optimistic). The 
same procedure was followed in dividing the 
group for autonomy. Those who rated + were 
considered in the upper group here considered 
and those who rated — were put in the lower 
group. Those who scored 0 were not con- 
sidered. (Since the middle group was there- 
fore excluded in the calculations of the chi- 
square values for both the autonomy and 
interpersonal adequacy ratings, the total N 
is somewhat less than 143 in these cases.) 

Table 1 gives a 2 * 2 comparison between 
the optimism and autonomy of the workers. 
The chi-square value of 10.97 ( < 001) and 
the corresponding phi coefficient of .35 indi- 
cate that those who have greater memory 
optimism also tend to be more autonomous or 
tend to be more secure and well adjusted. 

2 For details of procedure in the use of projective 


i tee of which this is a phase, see Meltzer 
1950), 
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TABLE 1 
ReLaTionswir OF Memory OrTiaism TO 
PERSONALITY VARIABLES 
| Group | 
> Optimism ú i | 
n . a 
Factor | group Upper Lower | * 
r, i 
| factor | factor 
n n 
Au 2 ppe 2 
tonomy Upper 24 1 35 lo or- 
Lower 18 39 
Interpersonal Upper 27 19 92 60 
competency’ Lower 34 3i oe 
N = 93. 
bN = ill. 
*p < 001. 


The workers were divided into two groups 
according to interpersonal competency. Those 
who rated + were placed in the upper group 
and those who rated — were placed in the 
lower group. A 2 X 2 comparison of memory 
optimism and interpersonal competency is 
given in Table 1, The nonsignificant chi- 
square value of .66 and the correspond- 
ing phi coefficient of .02 must be inter- 
preted as failing to support any relationship 
between memory optimism and interpersonal 
competence. 


Relationship of Work Competency to 
Autonomy, Interpersonal Competency, 
and Age 


To compare these variables the workers 
were divided into upper and lower groups for 
each factor. For the work competency vari- 
able, those with a C+ and above were put 
in the upper group, and those with C and 
below were put into the lower group. For the 
age variable, those above 40 were put into 
the upper group and those below 40 were put 
into the lower group. The divisions for the 
other factors are the same as given in Table 1. 
The comparison of all of these variables was 
made in a 2 X 2 fashion and the results are 
given in Table 2. Chi-square values were all 
significant and show that the more competent 
workers also tend to be more often memory 
optimists ($ = 19, p < .05), tend to be more 
autonomous (¢ = .40, p < .001), tend to be 
more competent in interpersonal relations 
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($ = 23, p < .01), and tend to be younger 
($ = 20, p < 01). 

All in all, what the results reported in 
Table 2 indicate is that the more competent 
people rated higher in performance (work 
competence), also were more autonomous, 
rated higher in interpersonal competence, 
were younger, and were more likely to be 
memory optimists. 


Memory Optimism and Personified Values 


After the group of workers was rated ac- 
cording to dominant personified values in 
their lives, it was found that economic and 
political values classified nearly all of the 
workers. With one exception, then, the sta- 
tistical treatment of the data employed these 
two divisions. The same divisions into upper 
and lower groups were made for the other 
variables as were given in the previous tables. 
The comparison of these variables was also 
made in a 2 X 2 fashion and the results are 
given in Table 3. This table shows that those 
workers who were classified as being more eco- 
nomic minded as compared to those who were 
more political minded were less often memory 
optinſists (not significantly so), were rated 
as less competent on the job (¢= .19, 
Y < .01), and tended to be older (& = 20, 


TABLE 2 


RELATIONSHIP OF WORK COMPETENCY TO 
PERSONALITY VARIABLES 


Work Group 
compe- 
Factor tency Upper Lower $ * 
group factor factor 
n n 
Autonomy® Upper | 32 17 
Lower 11 33 | -40 | 14.50" 
Interpersonal Upper | 36 25 
competency” | Lower 18 32 23 5.83% 
Ages Upper 32 42 
Lower 44 25 20 6.04* 
Memory Upper | 38 36 
optimism? Lower | 20 49 19 5.15* 
aN =93. 
BN =111. 
e N = 143, 
*p <05. 
* p < 001. 


H. MELTZER AND D. Lupwic 


TABLE 3 
MEMORY OPTIMISM AND PERSONIFIED VALUES 
= z = 
Group 
Factor Value Upper Lower $ * 
factor factor 
n n 
Optimism* | Economic 26 38 
Political 2 ** 
Work Economie 25 39 19 | 4.28" 
compe- Political 33 24 
tency* 
Age⸗ Economic 40 23 + 
Political | 25 | 33 204.65 
Age? Theoretical 1 5 = 
Rengon 6 | o 848.57 
aN =121. 
DN = 12. 
*p <05. 
* <.01. 


< .01). This table also shows that those 
few workers who were more religious minded 
as compared to those who showed more theo- 
retical interests tended to be older (p < .01). 
The phi coefficient for this comparison is .84 
and the small number of cases needs to be 
taken into account for interpretation of this 
figure. 


DISCUSSION 


The purpose of this paper was to study the 
relationship between memory dynamics, per- 
sonality, and performance variables in a field- 
study situation with a group of 143 workers 
as Ss. The memory dynamic under considera- 
tion was the degree of memory optimism, the 
performance variable was the competency 
rating of the worker, and the personality 
variables included autonomy, interpersonal 
adequacy, and the value system of the 
workers. The age factor was also considered. 

Looking first at memory optimism, a num- 
ber of factors seem to be correlated with the 
degree of such optimism. The workers more 
optimistic about their past experiences also 
tend to be judged as more adequate on the 
job and more autonomous than do the less 
optimistic workers. It would seem from this 
that the success on the job for the workers 


Memory Optimism 


of this community is a very important factor 
in their memory optimism—r at least the two 
go together. The role of this factory in the 
community would seem to bear this out, for 
getting a job at this factory is something 
looked forward to by many in the area. The 
relation between autonomy and optimism is 
also a logical one and would seem to indicate 
a positive outlook on the part of these work- 
ers, In other words, those who feel more 
secure and adjusted can view their past posi- 
tively and with less bitterness—thus showing 
memory optimism. This is in accord with 
Purcell’s (1952) finding that relative frequen- 
cies of joy and fear memories are significantly 
related to security feelings. 

Looking at the factors related to the rated 
job adequacy of the workers, some interesting 
relationships are also present. Those who are 
rated higher on job adequacy also tend to be 
more autonomous and interpersonally compe- 
tent, tend to be younger, and tend to be more 
politically than economically oriented. Aside 
from actual ability, which was held constant 
in this study, there would seem to be a syn- 
drome for the adequate worker. Good adjust- 
ment and the ability to get along well with 
others would certainly enhance the productiv- 
ity of the worker, as this and many other 
studies bring out. As far as the younger 
worker’s being more productive or more ade- 
quate, this would probably be a factor of the 
type of work required in this factory. The 
skills required to run the machines are not 
very complex, so there would not be an in- 
crease in job adequacy with more practice, 
whereas actual motor skills have been shown to 
decrease with age. The value system for the 
adequate worker is also an interesting finding. 
It would seem that the better worker is not 
working just for the money nor is he con- 
cerned primarily with increased pay (eco- 
nomic value), but is more interested in de- 
veloping his skills and looking forward to 
advancement (political or power value). Such 
a person would also tend to look for ways 
of improving production and gaining prestige 
in the eyes of his superiors rather than con- 
Centrating on or complaining about his salary. 
This work attitude would tend to favorably 
influence the competency rating of the worker. 

These findings remind one of statements 
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made by Dobzhansky (1967) in an article in 
Science on “Changing Man“: 


Optimists believe that ours is the best of all pos- 
sible worlds. And pessimists are those who fear that 
the optimists are right. This is a flippant, but valid 
statement of truth. Optimism is often a result of 
ignorance of cold and unwelcome realities. There is, 
however, another kind of optimism, which is pes- 
simism surmounted. The world is far from perfect, 
but it is not unalterable, I am tempted to call this 
evolutionary optimism [p. 409]. 


What he calls “evolutionary optimism,” 
Meltzer and Ludwig (1967) have called 
realistic optimism. They reported responses 
of some people who gave no unpleasant ex- 
periences whatever, but whose life stories in- 
dicate them to be eternal or wishful optimists 
(who have repressed unpleasantness) rather 
than realistic optimists. Realistic optimists 
face unpleasantness as well as pleasantness 
as realities they react to and profit from. It 
is not surprising, therefore, to find a positive 
relationship between memory optimism and 
work competence as well as work competence 
and personality variables. 

Looking specifically at the factors related 
to the value system of the workers, we have 
already seen that the political value is related 
to job adequacy. There are also significant 
differences in values according to the age of 
the workers. The older workers tend to be 
more economic and religious minded, whereas 
the younger workers tend to be more political 
and theoretical minded. Looking first at the 
older workers, it would seem that the threat 
of dependence that comes with the approach 
of retirement brings about a greater emphasis 
on the monetary value. Also the reality of 
death and the end of the existence in this 
world seems to bring about a greater emphasis 
on the religious values of life. The younger 
workers, with a life ahead of them, tend to 
look more for power and prestige, and with 
the threat of death more remote tend to 
concentrate on contemporary living rather 
than on religious values. Thus, in general, it 
would seem from this study that, by the 
use of a more clinical approach in the inter- 
viewing process, knowledge concerning the 
nature of memory optimists in relationship 
to performance ratings and personality vari- 
ables and knowledge concerning the differen- 
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tial characteristics of more competent workers 
are obtainable. It must be added, however, 
that such a “critical approach” does add 
elements of subjectivity to the study. 
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RELATIONSHIP OF KUDER PREFERENCE RECORD 
VERIFICATION SCORES TO ADJUSTMENT: 


IMPLICATIONS FOR VOCATIONAL DEVELOPMENT THEORY 


FRANK A. NUGENT 


Western Washington State College 


High school males with inappropriately low Verification (V) scores on the 
Kuder Vocational Preference Record (KPR) were compared to high school 
males with acceptable V scores on the California Psychological Inventory (CPI) 
to investigate whether personal conflicts might be contributing to low V scores 
Intellectual capacity and testing conditions were controlled. Incorrectly 
marked or incomplete inventories were eliminated. The final N was 256 boys 
with acceptable V scores and 83 with inappropriate ones. Students with 
acceptable V scores showed significantly higher mean scores on 15 of the 18 
CPI scales. Thus, the V score might be considered 1 index of personal and 
social maturity, Assumptions that vocational choice is an integral part of total 
personality development and that vocational maturity relates to total maturity 


received some support. 


The purpose of this investigation was to ex- 
plore the relationship of the Verification (V) 
score of the Kuder Vocational Preference 
Record (KPR) to adjustment. 

Kuder developed the V scale as a check on 
the confidence that can be given to a stu- 
dent’s answers on the interest inventory. In 
the KPR manual he discusses possible reasons 
for V scores which fall outside the range 
(38-44) which he has conceived of as ac- 
ceptable for meaningful interpretation of in- 
terest profiles (Kuder, 1957). Scores above 44 
he points out are usually a result of not 
understanding directions since 44 is the high- 
est possible score. The reasons given for 
scores below 38, however, are varied and 
less clear cut. The explanations for low devi- 
ance include not only the possibility of the 
student marking the answer sheet incorrectly, 
but also the possibilities of reading difficulty, 
disturbing test conditions, physical illness or 
emotional upset, careless or insincere re- 
sponses, or an atypical interest pattern. 

Students with V scores below the accept- 
able range, who were tested under adequate 
conditions, who were technically accurate in 
test taking, and who had sufficient intelligence 
to comprehend the vocabulary of the inven- 
tory, are the Ss of this study. The focus of 
this investigation then is whether emotional 
factors are contributing to the inappropriately 
low V scores, 


Support for the contention that a low V 
score is an indication of emotional conflict 
is limited and predominantly conjectural. 
Kuder’s speculations about the relationship 
of the low V score to emotional factors have 
been echoed by some counselors operating on 
clinical hunches and observed behavior of 
counselors, but no studies relating personality 
characteristics or adjustment process to V 
scores of the Kuder profile are in the litera- 
ture. 

However, related assumptions that voca- 
tional choice is an implementation of the 
self-concept and an integral part of total 
personality development and that vocational 
maturity relates to total maturity have been 
important concepts of vocational development 
theory during the past decade (Super, 1957). 
If these assumptions are valid, then it is 
conceivable that high school boys who could 
not meaningfully respond to the choices of 
the KPR might be reflecting confusions of 
self-image due to distorted perceptions of 
themselves. Also, it is possible that conflicts 
with others significant in the environment 
might have led to inner tensions which ren- 
dered them less personally or socially mature 
than the average high school boy. In these 
contexts the V score of the Kuder might be 
considered one index of personal and social 


maturity. A ent 
Based upon Kuder’s conjectures, the clini- 
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cal hunches of counselors, and the assump- 
tions of vocational development theory, the 
following hypothesis was formulated: High 
school boys with V scores on the Kuder 
Vocational Preference Record within the ac- 
ceptable range (V-AR) will show more favor- 
able scores on a personality inventory than 
will high school boys with V scores not in the 
acceptable range (V-NAR). 


METHOD 


Kuder Vocational Preference Records (KPR) and 
California Psychological Inventories (CPI) were 
given in a 4-yr. high school to all males except those 
in special education classes. The town has a popu- 
lation of approximately 10,000 and is situated in 
the northwest region of the United States. The in- 
vestigator supervised all the test taking to assure 
optimal testing conditions. 

After eliminating individuals with IQs below 90, 
with incorrectly taken inventories, and with dis- 
simulated CPI profiles (Gough, 1964), the final N 
was 256 boys with V-AR scores and 83 with V-NAR 
scores. 

Mean scores on each of the 18 subscales of the 
CPI were obtained for both groups, and £ tests 
for significance of mean differences were obtained. 


RESULTS 


In Table 1 the frequency distribution of 
these scores is shown. As can be seen from 
Table 2, high school boys with V-AR scores 
had significantly higher mean scores on 15 
of the CPI subscales when compared to high 
school boys with V-NAR scores. The hy- 
pothesis was given considerable support. 

Generally, on the basis of these signifi- 
cantly different mean scores, individuals with 
V-AR scores when compared to individuals 


TABLE 1 
FREQUENCY DISTRIBUTION OF V-NAR KPR Scores 


Score 


16-17 
18-19 
20-21 
22-23 
24-25 
26-27 
28-29 
30-31 
32-33 10 
34-35 15 
36-37 32 
Total 83 


S =S 


Frank A. NUGENT 


TABLE 2 


COMPARISON OF SCORES ON THE CPI BETWEEN Hian 
Scnoot Boys witu VAR Kuper Scores AND 
THOSE WITH V-NAR Kuper Scores 


Kuder 
results 


CPI 


scores SD s 


Scale 


Measures of poise, ascendancy, and self-assurance 


Dominance V-AR 45.42 | 12.45 | —2.91** 
V-NAR | 40.94 | 11.19 

Capacity V-AR | 42.06 | 10.63 | —3.39* 
for Status V-NAR | 37.41 | 11.48 

Sociability V-AR | 47.29 | 10.57 | —4.07* 
V-NAR | 41.94 | 9.83 

Social Presence V-AR 51.78 | 10.74 | —1.48 
V-NAR | 49.76 | 11.00 

Self-Acceptance |V-AR | 52.52 | 10.56 | —1.83* 
V-NAR | 50.16 | 9.20 

Sense of VAR | 40.69 | 13.28 | —5.78** 
Well-Being V-NAR | 30.87 | 13.90 


Measures of socialization, maturity, and responsibility 


Responsibility V-AR | 43.78 | 10.49 | —7.36** 
V-NAR | 33.37 | 11.86 

Socialization V-AR 50.94 | 10.49 | —7.51** 
V-NAR | 40.46 | 12.57 

Self-Control V-AR 41.08 | 10.73 | —3.02** 
V-NAR | 36.96 | 10.81 

Tolerance V-AR | 41.73 | 11.48 | —5.57** 
V-NAR | 33.86 | 10.20 

Good Impression | V-AR | 39.83 | 9.35 | —2.07* 
V-NAR | 37.33 | 10.03 

Communality V-AR 50.96 | 11.32 — 7.55“ 
V-NAR | 38.30 | 17.89 


Measures of achievement potential and 
intellectual efficiency 


Achievement via VAR | 42.08 | 11.40 | —6.53** 
Conformance V-NAR | 32.60 | 11.71 
Achievement via VAR | 45.83 | 10.73 | —3.95** 
Independence | V-NAR | 40.54 | 10.10 
Intellectual VAR 42.83 | 11.91 | —7.55"* 
Efficiency V-NAR | 31.05 | 13.55 


Measures of intellectual and interest modes 


Psychological. | VAR | 48.82 | 9.90 2.68“ 
Mindedness V-NAR | 45.49 | 9.56 
Flexibility VAR | 52.03 10.18 84 
VNAR 52.72 | 10.15 
Femininity V-AR 48.17 9.22] -1.31 
V-NAR | 46.70 7.71 


able 


Note.—V-NAR = Verification scores not within accepta nie 


range; N = 83; V-AR = Verification scores within accept 
range, N = 256, : 
27 Doe iled test was used since prediction was directio 
* < 101. 


nal, 


KUDER PREFERENCE RECORD AND ADJUSTMENT 


with V-NAR scores tend to be more self- 
confident, self-assured, and self-accepting 
(measures of poise, ascendancy, and self-as- 
surance); more responsible and more emo- 
tionally and socially mature (measures of 
socialization, maturity, and responsibility); 
more adequately motivated, intellectually 
efficient, and resourceful (measures of 
achievement potential and intellectual effi- 
ciency); and more appropriately responsive 
to the demands and needs of others (Psycho- 
logical-Mindedness) . 


DISCUSSION 


Certain conjectures arise concerning stu- 
dents with V-NAR scores on the KPR when 
considered within the context of vocational 
development theory. Perhaps some of these 
students were subtly rebelling against the 
choice making demanded by the self-inven- 
tory as part of a general negative reaction or 
apathy to what they consider unreasonable or 
unattainable expectations from the environ- 
ment. Possibly with some students an impasse 
in decision making occurred because of unre- 
solved conflicts between their own inclinations 
and contradictory demands of parents and 
other authority figures. Such conflicts could 
contribute to self-doubts and confused self- 
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perceptions with concomitant immobilization 
in making choices. 

One further speculation is that some of 
these socially immature and personally in- 
effective students might not have had the op- 
portunity to explore their interests or the 
world of work. In these situations, lack of 
experience or naiveté could be contributing to 
low V-NAR scores. 

Counselors should be alert to the possibility 
that for some individuals low V-NAR Kuder 
scores may indicate pervasive personal prob- 
lems of which difficulty of vocational interest 
choice is a part, If the broader areas of con- 
flict are resolved, these individuals might 
then be able to evaluate their interests more 
realistically or might feel freer to system- 
atically explore occupational fields consistent 
with their interests and abilities. 
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An instrument was developed to measure managerial attitudes and behavior in 
communication-type interaction episodes. This self-recording form produced 
data on both managers’ interaction activities during their working day and on 
how they felt about each episode that took place. The results showed that the 
managers studied were more favorable to self-initiated interactions than to 
other-initiated interactions. The data also showed that managers tended to 
evaluate the interactions they had with their superiors more positively than 
those that they had with their subordinates. The implications of these findings 
for communications within organizations were discussed, and the usefulness of 
the self-recording instrument for future research studies was considered. 


There have been an increasing number of 
studies during the last 10 years that have 
focused upon managers and the managerial 
job (see, e.g., Vroom’s 1965 review). Yet only 
a relatively small proportion of these investi- 
gations have studied the actual behavior of 
managers on their jobs. As Dubin (1962) has 
pointed out, “It is one of the curiosities of 
the literature on managerial behavior that 
we know almost nothing about it in any de- 
tail [p. 12].” The few studies that have 
been made of managerial behavior (e.g. 
Burns, 1954; Carlson, 1951; Davis, 1953; 
Dubin & Spray, 1964; Martin, 1959; Shartle, 
1956) do provide some information about 
what managers actually do on their jobs, but 
they fail to enhance our knowledge of the psy- 
chological dimensions of such behavior. That 
is, they supply little or no data on why man- 
agers behave in a certain manner or on how 
managers react to different behavior episodes. 

The main purpose of the present study was 
to explore the possibility of gathering data 
that will link certain kinds of managerial at- 
titudes to specific behavioral episodes. In 
short, it was expected that if attitudes can be 
more closely linked to behavior then both 
the attitude data and the behavior data will 
become more meaningful. 

Previous studies of managerial behavior 
have shown that managers spend a high pro- 
portion of their time talking to other people. 
Burns (1954) reports, for example, that man- 


agers spend 80% of all their time at work 
talking to others. This suggests that the study 
of the interaction episodes which take place 
during a manager’s workday may be a fruitful 
place to begin studying some of the sub 
stantive issues concerned with managers’ job 
attitudes. Furthermore, the study of these 
episodes also should contribute to an under- 
standing of the communication process in of 
ganizations, particularly with respect to the 
questions of why certain communication pat- 
terns exist and why communication problems 
develop. 

There are at least two important dimen- 
sions to consider in studying managers’ reat 
tions to interaction episodes. The first of 
these has to do with who initiates the inter 
action. Psychologically, there would seem t0 
be a significant difference between a situation 
where a manager initiates an interaction 
one where he has the interaction initiated for 
him by someone else. In the former situation 
the manager is likely to be in control of the 
situation and the interaction is likely to be 
held to satisfy his needs, In the latter case 
just the opposite may be expected to occu, 
and the manager is likely to be in a situation 
where he has less control and where the meet 
ing is designed to fit someone else’s needs. Tf; 
as is suggested by the research on s$ 
groups (Collins & Guetzkow, 1964), people 
generally feel more positively about those ift 
teractions that satisfy their needs and in 
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which they exercise control, then self-initiated 

teractions should be more positively evalu- 

‘Wed than other-initiated interactions. 

_ A second relevant dimension in determining 
a manager's reaction to an interaction 
Kencerns the position level of the other per- 
wns involved in the interaction. However, it 
Is not immediately obvious as to whether an 
individual should typically feel most posi- 
tively about an interaction with his subordi- 
Mates or with his superiors, Arguments can be 
advanced for either prediction. When he talks 
fo a subordinate, a manager is clearly in a 
higher status and higher power position, and 

“this could put him in control of the situation 
and thereby create conditions that may lead 
him to feel positively about the interaction. 
A study by Hurwitz, Zander, and Hymovitch 
{1953), however, indicates that high-status 
and high-power individuals may tend to feel 

less positively about interactions than will 

Tow-status and low-power individuals. This 

“study found that, after interactions, low- 
power individuals liked high-power individ- 
wals better than high-power individuals liked 
lower-power individuals. This suggests that 
superiors interacting with their subordinates 
Might evaluate the episode less positively 
‘than would the subordinates, despite the fact 
that the former have more control in the 
Situation. 

In summary, the purpose of this study was 
both to develop and test an instrument for 
Measuring managers’ reactions to specific in- 
teraction behavior episodes, and to collect 
data that are intended to answer two substan- 
tive questions about managers’ reactions to 
these interaction episodes: Does the initiator 
of an interaction feel more positively about 
the episode than does the recipient? Does the 
Superior or the subordinate feel more posi- 
tively about an interaction episode between 
the two? 


METHOD 
Research Instrument 


Each respondent was asked to complete a number 
of self-recording forms. As can be seen from Figure 
1, the form asks for factual information about a 

vior episode as well as for the manager's atti- 
tudes toward the episode. The form itself is adapted 
one used earlier by Burns (1954). It asks for 
some of the same factual information (some of 
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nurse service, a girl scout council, and a city depart- 
ment of health. The 105 managers studied repre- 
sented all the managers employed by the social 
service organizations and a random sample of about 
40% of the lower and middle level managers em- 
ployed by the manufacturing firm. A comparison 
between the demographic characteristics of the sam- 
ple and the demographic characteristics of other 
samples of middle and lower level managers (eg., 
Porter, 1961, 1962) revealed good comparability with 
but one exception. Thirty-one of the managers jn 
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TO 
1. TYPE OF CONTACT 2. 


Letter or Memo 
Discussion in Group 
Discussion vith one 
individual 
Telephone 

Records or Reports 
Other 


3. TYPE OF ACTIVITY 4. 


Production/Operations 

Sales /Distribution 

personne l / Industrial 

Relations 

Public Relations/ 

Advertising 

General Administ ration 

Research and Development 
Engineering 
Finance/Accounting 
Other 


5. PURPOSE OF EPISODE 


Information or Advice 


Giving 
Receiving 


Creating 


6. ATTITUDE TOWARD EPISODE 
Valuable 
Dissatisfying 
Boring 
Precise 
Challenging 
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OTHERS INVOLVED. 


POSITION OTHERS INVOLVED 


Superiors 

Subordinates 

Peers 

Outsiders in Upper Management 
Outsiders in Middle Management 
Outsiders in Lower Management 
Other Outsiders 

None 


CONTACT INITIATED BY 


Self 
Other 


Instructions or Decisions 


Worthless 
Satisfying 
Interesting 
Vague 

Not Challenging 


Fic. 1. Copy of the Self-Recording Form. 


the present sample were females, while none of the 
other samples contained female managers. The fe- 
male managers all worked for the social service 
organizations. Because of this and because of certain 
organizational differences between the manufacturing 
firm and the social service organizations, it was de- 
cided to analyze much of the data from these groups 
separately in order to determine if comparable find- 
ings would appear in two rather diverse populations, 


Procedure 


Meetings with the managers in each organization 
were held at the beginning of the workday on Mon- 


day. All 105 managers who attended the meetings 
agreed to participate in the study. A researcher first 
gave each participant a copy of the instructions for 
completing the self-recording form and a sample 
form. He then went over the instructions carefully 
with the managers and pointed out that the research- 
ers were particularly interested in getting a form 
completed for each behavioral episode. 

Questions were then invited from the manages 
with respect to the instructions given, Most of the 
questions were concerned with when it was approptt' 
ate to fill out a self-recording form. A number 9 
specific incidents were discussed as examples. The re- 


continually stressed that be wanted to be 


igg the week in question. (It had been arbitrarily 
in advance that, because of thelr frequency 
ged highly specialized nature, managers’ contacts 
their secretaries would not normally be in- 
daded.) 
_ Each manager was then given 150 forms and five 
envelopes addressed to the researchers at 
Yale University. The managers were told that they 
were to complete the self-recording forms for 1 wk. 
ged that they were to begin as soon as the meeting 
ended. They were also told that at the end of cach 
day they should put the completed forms in an 
envelope and mail them directly to the University. 
Thus, cach manager was expected to mail five 
“groups of completed forms to the University, one for 
day of the study. This procedure was 
for several reasons. First, it was hoped that 
this would emphasize to the managers the im- 
portance of completing the forms immediately after 
the episodes occurred. It also provided the research- 
es with a record of when the episode took place. 
Finally, it served to emphasize that this was not a 
company sponsored project and that the company 


RESULTS 


A total of 4,390 completed self-recording 
forms were obtained. The mean number com- 
pleted (for 5 working days) by managers in 
the social service organization was 43.5, and 

the mean for managers in the manufacturing 

company was 39.9. Every manager completed 
at least 7 forms; the most that any manager 
completed was 112.2 Table 1 shows the 
frequency of the different types of contacts 
in the two samples. As can be seen from the 
table, the managers in both samples reported 
a large proportion of their episodes involved 
verbal interaction with others. Only 11% of 
the episodes of the managers in the manufac- 


1 The relatively wide range in the number of com- 
forms returned for the 5-day period raises 
the question of whether some respondents were 
neglecting to report some of their episodes. One 
way to check this, of course, would be to have 
rvers on hand in the work situation to record 
the factual occurrence of interaction episodes. This 
Was not feasible in the present investigation and 
ordinarily would not be feasible in most such investi- 
gations if any very large number of respondents were 
included in the sample. In the present study other 
available evidence—the fact that most managers’ 
forms accounted for their total time during the 
Working day—indicates that a high percentage of 
episodes were being reported. 
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TABLE ! 


Peacextace or Tyres or Contacts Rerosrto 
ZT Ee ee ů — — 


Type of contact | —— — service 
Groups * n 
Individual 4% i “ 
Telephone 14 = 
Letter 6 * 
Records 5 10 


organizations. 
Table 2 shows the percentage of these within- 
own-organization contacts that were with 
superiors, peers, and subordinates. The man- 
agers reported that they generally tend to 
interact most frequently with their subordi- 
nates. This is not surprising since there are 
undoubtedly more subordinates for these 
managers to interact with than there are su- 
periors. Burns’ (1954) data also show this 

tendency. 

Table 3 shows the percentage of the con- 
tacts that are initiated by self, both, and 
others. The data show a relatively even bal- 
ance between self and others, and they show 
that very few contacts are seen as being 
jointly initiated. Thus, in the analysis of the 
attitude effects of who initiates an interaction 
it was decided to focus on self- and other-ini- 
tiated contacts. Additional analyses of the 
data (not shown in the table) indicated that 
the episodes reported were about equally 
spread out during the day from 8 a.. to 4 


TABLE 2 


PERCENTAGE OF WITHIN-ORGANIZATION Contacts 
THAT INVOLVED SUPERIORS, PEERS, AND 
SUBORDINATES 


% manufactur- 


Contact ing company | % Social service 
Superiors 24.9 22.2 
Peers 43.2 32.2 
Subordinates 60.2 | 58.3 
— h —:q 


Note. These percentages add up to more than 100 because 
more than one ie of individual can be engaged in each 
interaction. 
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TABLE 3 


PERCENTAGE or Setr-, Orrs, AND Bomi- 
Inirtatep Contacts 


* P 

Initiator * — % social service 
Self | 55 50 
Both | 3 3 
47 


Other | a2 


P.M. (except for the 12:00-1:00 lunch hour 
period). 

The mean attitude scores toward all the 
episodes each manager reported were com- 
puted for him (Table 4). In this analysis the 
episodes were divided according to who ini- 
tiated the episode and who was involved in it. 
Then the mean of each attitude scale was ob- 
tained for all the episodes reported by a man- 
ager. Finally, the scores for each of the man- 
agers were combined, and the mean attitude 
scores for all participants were obtained. 
Thus, the data presented in Table 4 consist 
of means which were computed from the indi- 
vidual manager’s mean scores. Table 4 also 
reports a composite attitude toward each 
episode, computed by combining the data 
from the five attitude scales. Using the mean 
scores for each manager as the basic data in 
the study has the conservative effect of mak- 
ing Y (105) the number of managers rather 
than the number of episodes reported. It also 
allows the responses of each manager to be 
weighted equally regardless of how many 
forms the manager completed. 


TABLE 5 


Noumper or Manacers Woo Are More POSITIVE TOWARD SELF-INITIATED 
THAN TOWARD OtHER-IniTraTED INTERACTIONS 
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TABLE 4 


Mean Artirupe Scores ror Eact or THE 
Scares: INITIATOR AND LEVEL oF 
Person CONTACTED 


Attitude scale [Initiator 7 

Superior Peer 
| 

Valuable- Self 6.21 | 6.12 
worthless Other | 5.87 5.88 
Satisfying- Self 5.86 5.43 
dissatisfying Other | 5.31 | 5.46 

Interesting- Self 5.48 | 5.4 
boring Other 5.38 5.41 
Precise—vague Self 5.76 5.53 
Other 5.37 5.25 
Challenging-not | Self 4.79 4.90 
challenging Other | 4.99 4.75 
Composite for | Self 5.59 5.52 
five scales Other | 5.46 5.38 


Note.—Higher scores indicate more favorable 
Standard deviations for the means reported in this 
cally were close to a value of. 90. 


The data in Table 4 show, as expecte 
general tendency for self-initiated coni 
be more highly evaluated than other-init 
contacts. This tendency was tested for 
tical significance with a sign test (Tab 
For each manager a comparison was 
between his average ‘scores for his self-in 
ated and his other-initiated contacts. 
results in Table 5 confirm the fact tha 
both samples there is a general te 
for the managers to feel more 


Manufacturing Social service 1 

Kittens seal company organizations Tota 
I || ed AA ye a a E a AE 
Valuable-worthless 36 | 11 e e 0 
Satisfying-dissatisfying 31 17 2 .10 | 32 22 1 63 ah) 
Interesting—boring 27 19 4 28 24 3 55 43 
Precise-vague 20 16 5 -10 | 38 15 2 01 67 31 
Challenging-not challenging 21 26 3 26 | 27 z 47 53 
Composite for five scales 34 15 1 -OL 3320 2 e 35 
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TABLE 6 
Nuungr or Manacess ror Wou Tumie Artrrepe Scosgs asr Mose Poors 
ror InTexactions wrn Taxis Superior max wru Tueis Susosntxarss 
nn a_i SSS Se eee 
| Manufacturing Social service Total 
Attitude scale | * — mata 
| a DALEL = — i 
| + | - | 0] » + =j 0 $ + 0 r 
Valuable-worthless | 27 | 2 8 n | 18 ó os | % | 38 14 os 
Satisfying-dissatisfying | 28 | 20 7 nv | “| 7 os 57 u 14 os 
Interesting-boring 331% 6 | 0S | 2 | 16 | 6 | 10} 61 | 32 | . 
Precise vague | 29 18 8 | 21 “s so 2 is 
Challenging—not challenging | 31 17 7 10 | 27 | 47 | 6 | * M u as 
Composite for five scales | 36 | 13: | 6 | oO | a} 2] s| W 33 | ojm 
Note- Plus sign indicates more favorable response to interactions with superiors. Sign teets were two talied, 


episodes initiated for them by others. The 
only dimension that deviates slightly from 
this tendency is the challenging-not challeng- 
ing one. However, for the total sample, the 
overall efiect is highly significant as is shown 
by the results for the five-scale composite 
analysis. 

The data in Table 4 also showed that there 
is a general tendency for managers to feel 
more positively toward contacts that they 
have with their superiors than toward con- 
tacts they have with their subordinates. The 
peers’ contacts seem to fall between the su- 
perior and subordinate contacts. This ten- 
dency likewise was tested for significance with 
a sign test (Table 6). A comparison was 
made for each manager between his mean 
attitude toward all contacts he reported that 
involved his superiors and all those that in- 
volved his subordinates. As can be seen from 
Table 6, a significant majority of the mana- 
gers reported more favorable attitudes toward 
the contacts that they had with their superiors 
than the contacts they had with their sub- 
ordinates, Overall, this tendency appears to 
hold for both samples and for all of the atti- 
tudes studied. : > 

If the findings that self-initiated communi- 
cations and communications with superiors 
are evaluated highly are combined, this would 
Suggest that the most positive attitudes man- 
agers hold should be toward self-initiated con- 
tacts with their superiors. On the other hand, 
the least positive should be toward other- 
initiated contacts with their subordinates. The 


data in Table 4 showed that this was the 
case. There was a significant difference be- 
tween the mean composite score for self-ini- 
tiated contacts with superiors (7 = 5.59) and 
the mean composite score for other-initiated 
contacts with subordinates (F = 5.27) (¢= 
3.2, p < 01). 

Table 7 presents the composite attitude 
scores (the mean for the five scales) for all 
the “dyadic interactions” that could be found 
in the data. Here, self-recording forms were 
included only where it was clear that forms 
had been obtained from both a superior and 
his subordinate who were reporting on a com- 
mon episode. Thus, in these data, the re- 
ports of both people taking part in the same 
interaction could be compared. The previous 
data, of course, involved looking at the aver- 
age reaction of managers to many different 
kinds of contacts (i.e. groups, dyads, etc.) 


TABLE 7 


Mean ATTITUDE SCORES ror DYADIC INTERACTIONS: 
INITIATOR AND LEVEL OF OTHER PERSON CONTACTED 


Initiator 
Evaluator Grand Af 
Self Other 
Subordinate 5.78" 5.77 5.78 
Superior 5.71% 5.07% 5.42 
Grand M 5.74 5.45 


— —— ͤ üü—ͤ— 
Note. Higher scores indicate more favorable attitudes. 
an = 20. 
by = 24. 
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while the dyadic data allow for the compari- 
son of the attitudes of both participants to a 
specific interaction. As can be seen from Table 
7, the initiator of the interaction feels more 
positively toward it than does the recipient, 
and the subordinate in the interaction feels 
more positively toward it than does the su- 
perior. Sign tests were computed between the 
attitude responses of each pair of participants 
and these trends proved to be statistically sig- 
nificant ( < 05). These results, therefore, 
serve to confirm the results reported above 
for the larger analysis which involved all in- 
teraction episodes by the total sample of 
managers. 
Discussion 

Even though this study was essentially ex- 
ploratory, the results have both substantive 
and methodological implications. Relative to 
the former, they provide information for 
analyzing communications in organizations 
and for explaining certain job-satisfaction pat- 
terns that appear in organizations. From a 
methodological point of view, they are sug- 
gestive of the fact that meaningful data about 
attitudes toward specific behavior episodes 
can be obtained by the use of a self-recording 
form of the type employed in this study. 

The finding that managers tend to evaluate 
episodes they initiate more highly than epi- 
sodes initiated for them was anticipated. It 
would seem to follow from three factors. 
First, the initiator of any contact usually has 
the most control in the situation. Second, the 
initiator normally does not institute a contact 
unless he feels that a worthwhile issue that is 
important to him needs to be discussed. Third, 
the initiator normally should feel more re- 
sponsible for its taking place and might, 
therefore, be less likely to describe it nega- 
tively. 

The finding that managers tend to evaluate 
contacts with their superiors more highly than 
contacts with their subordinates is under- 
standable, even though it was specifically 
predicted. Part of the reason for this finding 
appears to be the fact that managers have 
relatively fewer contacts with their superiors 
compared to subordinates. Thus, a superior 
contact is relatively more “unusual” than a 
subordinate contact. In addition, the superior 
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has reward power over his subordinates. Thuy, 

a superior-subordinate interaction is likely 

to be a more significant event for a sub 
ordinate than for a superior. Apparently, as 

was suggested by the earlier study of Hur 

witz et al. (1953), this factor is significant 

enough to overcome any tendency that supe 

riors might have to evaluate the interaction 

more positively because of their superior com 

trol and power in the situation. 

This finding also fits with some of the prob 
lems that are often discussed where superior 
subordinate communication is involved. Haire 
(1956) and others have pointed out that, 
when talking with their superiors, subordinates 
often tend to overinterpret what is being said 
because they place too much importance on 
everything that is said. On the other hand, 
superiors may tend to do the opposite. The 
data from the present study suggest that this 
is in fact a real problem and that superiors 
do indeed tend to attach less significance to 
superior-subordinate interactions than do sub- 
ordinates. 

Given the results of the present study, it is 
not surprising that superiors often claim that 
they do not know what is going on in their 
organizations, because subordinates fail to 
communicate adequately. The results show 
that the least valued interactions are those 
where a superior evaluates an episode in 
which his subordinate comes to him for 4 
discussion. If, as seems likely, the superior 
communicates any of these reactions to his 
subordinates, then it will not be long before 
the subordinate will “learn” to reduce his 
initiation of these interactions. In effect, the 
subordinate is not likely to be reinforced (suf 
ficiently) for initiating communications a 
may very well decide that it is not worth com- 
ing to his boss with information, since his 
boss is not really interested. 

Turning to methodological implications, the 
data seem to show that the self-recording form 
approach used in the present study is a usefu 
way to measure managers’ behavior and thelr 
reactions to the behavior. As evidenced by the 
large number of forms that the managers 
were willing to complete, and by the results 
obtained, it appears that managers did take 
the study seriously and completed them con- 
scientiously. Many took the trouble to write 


Manacers’ Artrrunes v h Cosuwication Erisooes 


an important learning experience for them, 
since the forms helped give an overview of 
the days work. From a researcher's point of 
view, it was gratifying to find that the forms 
could be completed relatively quickly and 
easily and without seriously in 
distorting the day's typical work process. This 
approach also made it possible to measure the 
attitudes and behavior of a relatively large 
number of managers for at least a week's time. 
It would be difficult if not impossible to get 
comparable amounts of data by observation or 
any of the other commonly used approaches 
for gathering this type of behavioral data. 
This instrument could also play an im- 
portant role in developing what have been 
called “psychological” job descriptions, The 
typical job description provides a relatively 
poor behavioral description of the nature of a 
job. The self-recording form makes it pos- 
sible not only to collect better descriptive 
data but also to get at some of the feeling and 
attitude dimensions of the jobs. It can pin- 
point some of the psychologically difficult sit- 
uations and interactions in the job, and, most 
importantly, the collection of such forms 
across a large number and wide range of in- 
teractions facilitates the linking of such in- 
teractions to organizational factors. Future 
studies might also look at the behavior and 
attitude patterns of managers who tend to be 
described as being effective and ineffective 
performers. In addition, actual behavior pat- 
terns might be related to the overall job satis- 
faction of the managers. Thus, a significant 
aspect of the present exploratory study is that 
it demonstrates the utility of the self-recording 
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form and thereby suggests a sumber of future 
studies that can profitably be dome ia the 


realm of organizational psychology. 
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EMOTIONAL MEANING OF FOUR TYPOGRAPHICAL VARIAE 


ALBERT J. KASTL axo IRVIN L CHILD 
Yale University 


The influence of 


each mood, it was found 


dreamy, and tend 
sans-serif type; while moods 
to angular, bold, and perhaps serif type. 


Typefaces used in books or advertisements 
seem often to be chosen to emphasize or con- 
vey certain moods. This research was con- 
cerned with some of the physical attributes 
of type which apparently make mood com- 
munication possible. 

The fact that certain typefaces can convey 
moods has long been established (Berliner, 
1920; Burt, Cooper, & Martin, 1955; Davis 
& Smith, 1933; Ovink, 1938; Poffenberger & 
Franken, 1923; Schiller, 1935); among these 
earlier studies only those by Ovink and Davis 
and Smith make generalizations beyond indi- 
vidual typefaces. Ovink concluded that the 
light-bold dimension was primarily respon- 
sible for the emotional meaning of typefaces, 
Davis and Smith maintained that several ad- 
ditional factors were also responsible: size, 
condensation, slant (italic versus roman), and 
style of type (a complex group of factors dis- 
tinguishing one family of type from another, 
as the difference between Caslon and Chelten- 
ham type). Their pioneering experiment faced 
the complexity of the problem and represented 
a significant advance; yet it has several lim- 
itations: 

* The authors are indebted to Stephen Dwoskin for 
technical advice on typography and to John D. Car- 
roll for assistance in the statistical analysis. The in- 
vestigation was carried out during the tenure by 
Albert J. Kastl of a National Science Foundation 
predoctoral fellowship. A more detailed statement of 
results, a classified list of the typefaces, and sam- 
ples of their appearance have been deposited with the 
American Documentation Institute. Order Document 
No. 10004 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress, Washington, D. C. 20540. Remit in advance 


— $1.25 for photocopies or $1.25 for microfilm and 


make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 


440 


4 typeface variables (angular versus curved, bold versus light, 
simple versus ornate, serif versus sans-serif) on judgments of emotional mean- 
ing was studied in 40 college students, In addition to specific conclusions for 
that, in general, moods such as sprightly, sparkling, 
to be matched to curved, light, ornate, and perhaps 
such as sad, dignified, and dramatic are matched 


(a) The categories employed are those tt 
ditionally used by typographers. It m y 
questioned whether these categories are 
ficiently distinct to ensure that layme 
spond to those features of type which d 
mine traditional categorization. 

(b) The results are of uncertain statist 
significance. i 

(c) Only 13 typefaces were used, so t 
the categories are represented very poor 
(At the extreme, only one type represents t 
entire category of expanded.) j 

A recent study by Tannenbaum, Jacobs0 
and Norris (1964) proceeds from critici 
similar to ours of earlier work, but is primar 
concerned with type characteristics differ 
from those we consider. We learned of t. 
work only after completing ours, and so 00 
not design the present study in relation 
theirs. Later, their findings will be co 
with our own. * 

We chose for examination the following d 
mensions: * 

A. Angular versus Curved. For simple lin 
an experiment long ago (Poffenberger & Bi 
rows, 1924) supported the expectation tl 
curves readily suggest quiet and gentlene 
angles, harshness and seriousness. We es 
a similar relationship with typefaces, 0 
because of this experimental finding and bi 
cause of the general use of this dimension 
visual art. 

B. Bold versus Light. This is the dime 
stressed by Ovink (1938). Also, artistic 
Suggests an important role for this variabl 

C. Simple versus Ornate. This typeface € 
mension seems not to have been studied 
Psychologists, but its general role in art (cf 
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Barry, 1957 and Fischer, 1961) suggested its 
inclusion here. 

D, Serif versus Sans-Serif. This dimension, 
too, has not yet been systematically studied 
for its possible emotional meaning. Serifs 
partly All the gaps between characters and tie 
up the bulk of the word in a horizontal direc- 
tion. An obvious hypothesis is that serifs will 
suggest forceful and stable moods. 

The limitations indicated earlier were met 
by (a) having Ss themselves judge the physi- 
cal attributes of the typefaces; (6) using ap- 
propriate techniques to evaluate the 
significance of the effects; (c) employing a 
relatively large number of types incorporated 
in a factorial design. Thirty-two types were 
used. They fall in two sets, each set com- 
prising variation in three of the four dimen- 
sions. Types 1-16 varied in Dimensions A, B, 
and C; Types 17-32 varied in Dimensions A, 
B, and D. In each instance the three dimen- 
sions were combined in an orthogonal design; 
each dimension had two levels, and each com- 
bination of conditions was represented by two 
different typefaces. Thus the same types con- 
stitute 32 samples of type for studying Di- 
mensions A and B, and 16 each for studying 
Dimensions C and D. 


MetHop 
Subjects 


The Ss were 40 students in introductory psychol- 
ogy, 20 men at Yale University and 11 men and 9 
women at The City College, New York (CCNY). 


Procedure 


Preliminary testing was done to insure the physical 
validity of the dimensions. Eleven judges were asked 
to categorize each of about 40 types we had selected 
for possible use, Approximately 20 were classified as 
angular or curved, bold or light, and simple or or- 
hate, and another 20 as angular or curved, bold or 
light, and serif or sans-serif. When disagreement ap- 
peared, the experimenters substituted new types un- 
til adequate agreement was reached (at least 9 of 
11 judges concurring with our initial categorization). 
Thirty-two typefaces, 16 of each group of 20, were 
any selected and made into ~ Each mags con- 

ed in a single typeface, and always in upper- 
case, the first 16 letters of the alphabet arranged in 
four rows and four columns. The letters were of 
fairly uniform height, but since shape varies between 
and within typefaces, width could not be - 
ized. The experimental procedure at Yale and CCNY 
Was essentially the same. Twenty Ss were divided 
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Types 1-16 were evaluated on the dimensions 
lar-curved, bold-light, simple-ornate; 17-32 on 
dimensions angular-curved, bold-light, serif-sans-seri 
Fourteen of the Yale Ss and 12 of the CCNY Ss 
participated in this part of the experiment. (Due to 
a procedural error, judgments of the Yale group on 
the simple-ornate and serif-sans-serif dimensions 
were not obtained.) 


=F 


RESULTS AND DISCUSSION 


lf Ss give different mood ratings to type- 
faces of various categories, are they respond- 
ing to the stimulus features intended to pro- 
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vide the basis for discrimination? An affirma- 
tive answer received advance support from 
the fact that preliminary judges agreed well 
with the experimenters’ initial categorization. 
More pertinent, however, would be direct evi- 
dence that Ss themselves are able to discrimi- 
nate on the intended basis. It was for this 
purpose that at the end of the experiment 
the Ss were asked to classify the typefaces. 
The results, in relation to what we intended, 
are indicated in Table 1. For example, as may 
be seen in the top line of the table, for 7 of 
the 16 types classified in advance as bold, 
over 95% of the Ss participating in this part 
of the experiment also classified them as bold; 
for 3 of the types classified in advance as 
light, over 95% of the Ss classified them as 
light; and so on. The other lines indicate the 
number of typefaces for which each lesser de- 
gree of agreement was found. Though agree- 
ment was not perfect, Ss taken as a whole 
differentiated consistently between types in- 
tended to be bold and those intended to be 
light, and between types intended to be serif 
and those intended to be sans-serif. On the 
other two dimensions, some individual types 
are not perfectly discriminated from all those 
intended to contrast with them; yet for each 
dimension the two sets of typefaces taken as 
a whole are very clearly differentiated. 

To illustrate the statistical analysis of the 
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mood ratings we will use the judgments made 
of the first 16 types for Adjective Group a. 
These 16 typefaces varied in Dimensions A, 
B, and C. The effects of these dimensions on 
Adjective Group a were studied in a five-way 
analysis of variance (A X B x C x ENS). 
(E represents the college variable, Yale or 
CCNY; and 8, individual Ss). In Table 2 each 
of the main effects and interactions was com- 
pared with a term measuring its variation 
among Ss in order to obtain an F ratio (e.g, F 
= A/AS). Thus for the data from the first 16 
types, an F was obtained for the A effect 
(23.86, p < .001), for the B effect (24.08, 
p < 001), and for the C effect (50.06, p< 
001). Similarly, for the second 16 types, an F 
was obtained for the A effect (25.03, p< 
001), for the B effect (48.86, p < .001), and 
for the D effect (8.13, ns). 

The large size of these effects deserves em- 
phasis. On Adjective Group a (sprightly), the 
8 bold types among the first 16 had an aver- 
age rating of 1.63, and the light types 2.41. 
An average rating of 1.82 was assigned to the 
angular types and 2.22 to the curved. For 
simple types the average was 1.57 and for or- 
nate types it was 2.47. These are large dif- 
ferences; though the scale went from 0 to 5, 
the mean of all Ss’ ratings on Mood a varied 
among typefaces only from .5 to 3.3, with a 
standard deviation of .78. 


TABLE 1 


CORRESPONDENCE BETWEEN SUBJECTS’ JUDGMENTS OF PHYSICAL QUALITIES or TYPEFACES 
AND THE EXPERIMENTERS’ PRIOR CLASSIFICATION 


No. typefaces yielding each percentage of agreement, 
for typefaces originally classified by Es as 


Percentage of Ss 
whose judgments 
agreed with Es’ 
prior classification Bold W 
100 7 3 3 
90 3 8 8 
80 3 2 3 
70 1 2 
60 1 1 
50 1 
40 1 
30 
20 1 
10 
0 


Curved | Simple | Ornate | Serif | Sans-Serif 
pike eae | ioe. | 

2 5 2 6 4 

6 2 3 2 3 
it 

3 

1 1 

2 

2 1 1 
i 


Note. V = 26 for each of the first 4 columns of data and 12 for each of the last 4, 


nearest multiple of 10. 


Percentages have been rounded to the 


MEANING or Tyroomariicar Vasisuss w 
TABLE 2 
F Tests Suowwo men Ivvtcence or me Tyrerace Vastastes (ao mrw 
Iyrezactioms) ox mwe Marcus of Mooo-Anjecrwe Sers 
— — — — ——— —— eee ee 
Mood.ad jective seta 
* | b | c | d | e | as à 
Single variables 
Angular-Curved (A)| 23.80%% | 39.78% | 27.51% now | ree | cas 
25.03% | 15.53" 179 783 | nIe | ase 
Bold-Light B) | 24.08" | 14.00" | 101.13" asos | isare | ass 
48.80% | 4430% | 110.30" 12.28" M | 125 
Simple-Ornate (C) 80 | 95.09%" | 6240% Lm | pa 
Serif-Sans-Serif (D) 9.60 18.46"" Ou 


AXB 


higher for angular, 


33 
— — 


Note. When results indicate that mood tends to be . 
1 rating tends to be e or sans-serif type, the F is in italics. 
> 8001. 


Table 3 provides a verbal summary of the 
ions and interactions which influence 
communication of each mood. The dimensions 
are represented in each instance by that end 
of the dimension which makes for communica- 
tion of the particular mood. For interactions, 
a dimension is represented by that end whose 
Presence enhances the main effect. Only those 
dimensions are included for which one of two 
Fs for the main effect was found significant 
at the .001 level (or both at the. OI level) or 
the single F significant at the .01 level. We 
will now supplement the table by summarizing 
another way, reviewing the main effects and 
the interactions of each of the typeface di- 
mensions. 
Curvilinearity clearly tends to convey the 
moods expressed by the five Adjective Sets 


a, b, e, d, and h (sprightly, sparkling, dreamy, 
calm, and soaring). Angularity, on the other 
hand, conveys the mood of Adjective Set f 
(dignified). Thus there is considerable agree- 
ment between the Poffenberger and Barrows 
(1924) study on the feeling tone of lines and 
our findings about the effect of angularity and 
curvature in typography. 

Light typefaces tended to be rated high by 
our Ss on the moods represented by Adjec- 
tive Sets a, b, c, and d (sprightly, sparkling, 
dreamy, and calm), Bold typefaces were rated 
high on the moods expressed by Adjective Sets 
e, f, and g (sad, dignified, and dramatic). 
These findings seem to agree well with the 
Ovink and Davis and Smith studies to the 
extent that we can properly match our moods 
with their somewhat different concepts. Ovink 
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TABLE 3 


TYPEFACE CHARACTERISTICS SIGNIFICANTLY MATCHED 
TO Sets OF Moop-ADJECTIVES, AND 
SIGNIFICANT INTERACTIONS 


Characteristics entering into 
Set of mood-adjectives | = = 


Main effects Interactions 


(light,* sans-serif*) 
ete curved) 
(bold) 


a*—sprightly, cheerful, 1. curved 
bright, vivacious, 2. light 
happy, gay. joyous 3. ornate 


i 1. curved (sans-serif*) 
ial, humor- 2. light (simple,* sans-serif) 
ous, whimsical, fanci- | 3, ornate (bold) f 
ful, quaint 4. sans-serif | (cur ved. * light) 
f 
c—dreamy, dainty, deli- «curved (light*) 


cate, light, graceful, 
lyrical, poetic, leisurely, 
gentle 


1 

2. light (simple. curved) 
3, ornate (beld) 

4. sans-serif | (curved) 


dh—calm, serene, sooth- « curved n 


1 
ing, tender, longing, 2. light simple, curved, serif) 
romantic, plaintive, 3. ornate (bold*) 
sentimental 
e—sad, pathetic, mourn- 1. bold (simple*) 
ful, melancholy, de- 
pressing, gloomy, 
heavy, tragic 
f—dignified, spiritual, 1. angular | (simple) 
solemn, sober, serious, | 2. bold (serif*) 


grave, earnest. lofty, 


philosophical 
dramatic, forceful, 1. bold (simple,* serif) 
vigorous, martial, 2. serif (curved,* bold*) 


ponderous, emphatic, 
majestic, exalting 


eur ved 
. ornate 


h—soaring, triumphant, 
elated, exciting, im- 
petuous, restless, stir- 
ring, spirited 


Re 


(angular) 


Note.—When_no asterisk appears in the Interactions 
column, the significant main effect (e.g., on the third line of the 
table, ornate) is separately significant at each end of the inter- 
acting variable (i.e., bold and light) at least at the .05 level, 
though more marked at the end named (i. e., bold). 

* The triple interaction for Mood Group a suggests that for 
serif type the effect of boldness is enhanced by angular type and 
for sans-serif type the effect of boldness is enhanced by curved 
type. However, for both serif and sans-serif type the effect of 
lightness is enhanced by curved type, 

The first triple interaction for Mood Group d suggests that 
the effect of brightness is enhanced in curved types for both 
simple and ornate types. For ornate type the effect of boldness 
is enhanced in curved types; for simple types there is a slight 
but quite insignificant trend suggesting that the effect of bright- 
ness is enhanced in curved type, for serif type only. The second 
triple interaction suggests that for serif type the effect of bold- 
ness is enhanced by angular type; for sans-serif type the effect 
of boldness is enhanced by curved type. 

* The main effect remains separately significant only for the 
end of the interacting variable named in the Interactions 
column; with the opposite level of the interacting variable 
the main effect of the first variable is not significant at even the 
.05 level, though it is still in the same direction. 

** The significant effect of sans-serif type is maintained only 
for curved type, For angular type the effect is significantly re- 
versed (p C. 05). 


noted that (1) the qualities of luxury and re- 
finement are related to low pressure (light 
type), (2) soberness and economy to rather 
high pressure (bold type), and (3) force and 
strength to high pressure (bold type). This 
relationship between adjectives and type cor- 
responds to our Adjective Sets c, f, and g, on 
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which we found light (c), bold (f), and bold 
(g) types, respectively, to be rated high. 
Davis and Smith noted that the qualities of 
delicateness and beauty were related to non- 
bold (medium) type, while strength was re- 
lated to bold type. This relationship corre- 
sponds to our Adjective Sets c and g, on which 
our Ss gave high ratings to light and bold 
types, respectively. 

Examination of the interaction of this di- 
mension with the others suggests that (a) 
significant main effects of boldness are sig- 
nificant only if the type is simple rather than 
ornate, and (b) significant main effects of 
lightness are generally enhanced if the type 
is simple rather than ornate. The fact that the 
main effects of both lightness and boldness are 
greater in simple than in ornate type suggests 
that the present results have some generality 
for standard typography, since simple type- 
faces are usually employed. 

Considering the simple-ornate dimension, 
we find that ornateness is judged to convey 
the moods of Adjective Sets a, b, c, d, and h 
(sprightly, sparkling, dreamy, calm, and soar- 
ing). There are no significant main effects in 
which simple type is rated higher on expres- 
sion of a mood. Examination of the interac- 
tions of this dimension with the others shows 
that although simple type shows no si gnificant 
main effects, it does enhance several main 
effects of other dimensions; ornateness never 
does this. This interaction indicates—to state 
more generally a point anticipated in the 
previous paragraph—that while simplicity by 
itself does not suggest a particular mood, it 
can render more conspicuous other distinctive 
features which do. Ornateness, on the other 
hand, apparently is so striking and distinctive 
in its emotional meaning that it diminishes 
the effects of all other typeface features simul- 
taneously present. 

Studying the serif-sans-serif dimension, we 
see that sans-serif type tends most strongly 
to suggest the moods of Adjective Sets b and 
c (sparkling and dreamy), and serif type the 
mood of Adjective Set g (dramatic). Although 
there are only these three significant main 
effects for this dimension, there are seven 
cases of significant interaction with main 
effects of other dimensions, For Adjective Sets 
a, b, and c (sprightly, sparkling, dreamy), 
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sans-serif type either enhances other main 
effects or is a significant main effect; for Sets 
d, f, and g (calm, dignified, dramatic), serif 
type either enhances the main effects or is 
itself a significant main effect. Comparing 
these two groupings of moods, we see that the 
first suggests freedom and high spirits; the 
second, restraint and seriousness. Apparently 
the serifs, by providing a horizontal linkage, 
facilitate communication of stability and em- 
phasis as was originally hypothesized, and 
their absence has an opposite effect. 

The main effects we have reported are not 
greatly limited by the interactions. In only 
two instances is a significant main effect of 
one variable reversed for one level of a second 
variable: For Adjective Group b (sparkling), 
the significant main effect of sans-serif is actu- 
ally found only if the type is curved; if it is 
angular, serif types are instead rated higher 
on this mood, and this tendency is itself sta- 
tistically significant (p < .05). For Adjective 
Group g (dramatic) the significant main effect 
of curved type obtained with the second 16 
typefaces is actually found only if the type 
has serifs; here a slight reversal not approach- 
ing significance appears if the type is sans- 
serif. 

When we look beyond these two exceptions, 
we find that the interactions do not cast doubt 
on the general validity of the main effects. 
For the 12 interactions marked with an as- 
terisk in Table 3, the main effect of one vari- 
able is separately significant (p < .05) at only 
one level of the second variable, yet at the 
other level the effect still runs in the same 
direction. For the 16 interactions without as- 
terisks in Table 3, the significant effect of the 
first variable remains separately significant 
(b < .05) for each level of the second, though 
the interaction shows the effect to be sig- 
nificantly larger for one level than for the 
other. Thus in a total of 28 cases of signifi- 
cant interaction the direction of the main 
effect is maintained for both levels of the 
second variable and in 2 it is not—evidence 
that the main effects are generally consistent 
in direction, A 

There were three significant triple inter- 
actions; only one of them (in Adjective Set 
d) affected the findings presented in Table 3. 
(This interaction indicates that only for serif 
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types is the effect of lightness enhanced by 
curved types.) The fact that 13 of 16 triple 
interactions were insignificant suggests that 
the main effects and first-order interactions 
are on the whole maintained from level to 
level of the third variable. 

Interactions of the college variable (Yale 
or CCNY) with the main effects and interac- 
tions presented in Table 3 were also com- 
puted. There were 88 such interactions; only 
4 of them are significant at the 5% level, and 
of these only 1 suggests a reversal of a sig- 
nificant effect (it does so by so slight a margin 
as not to justify attention in the context of 
general agreement). The general agreement 
between colleges indicates that the findings 
are generalizable to other similar groups. 

Some of these diverse findings can be or- 
ganized into a single general scheme. Adjec- 
tive Sets a, b, and c (and often d and h) form 
a group of moods which our Ss tend to find 
well expressed by light, ornate, and possibly 
sans-serif typefaces. On the other hand, Ad- 
jective Sets f and g (and often e) form another 
group on which instead the angular, bold, and 
possibly the serifed types are rated high. The 
a, b, and c grouping consists of those adjec- 
tives which express happiness, joviality, deli- 
cacy; the f and g grouping expresses dramatic, 
forceful, serious, and philosophical moods. 
Each grouping consists of neighboring sets on 
Hevner’s mood circle, in which adjacency sig- 
nifies similarity. With fair adequacy, the two 
groupings may be characterized as suggesting 
femininity versus masculinity in connotative 
meaning. Viewed in this way, curvilinearity, 
lightness, ornateness, and absence of serifs 
convey femininity, while their opposites convey 
masculinity. This is certainly a helpful gen- 
eralization for keeping the overall tenor of our 
results in mind, and the suggestion that sexual 
meaning is involved may be of considerable 
importance. (In another context, Jones and 
Lepson, 1967, have found black to be asso- 
ciated with masculinity, gray with feminin- 
ity.) A masculine—feminine polarity provides, 
however, only a most general conception of 
results that are really much more complex. 
For example, Adjective Set h (soaring) seems 
on the whole to have more masculine than 
feminine connotation, and yet our Ss rate high 
on it typefaces with characteristics we have 


446 


just designated as feminine (lightness, curvi- 
linearity, and ornateness). 

The relationships we find are probably 
brought about by many processes. As our 
study provides no basis for evaluating the 
relative importance of one and another proc- 
ess, we will not present here our speculations. 

Our results may be put together with those 
of Tannenbaum et al. (1964) to support their 
conclusion that “there is a substantial basis 
for the use of typography as a code for the 
communication of connotative variations p. 
72.” Specifically, Tannenbaum et al. provide 
evidence that connotative meaning is influ- 
enced by slant (italic versus roman), by case 
(lower versus upper), and by type family, The 
serif-sans-serif dimension entered their re- 
search post hoc, and apparently they could 
not confidently ascribe any influence to it, 
Our research supplements theirs, then, by 
demonstrating the influence of this dimension. 
In addition, we show the great influence of 
three dimensions with which they were not 
concerned: angular-curved, bold-light, and 
simple-ornate. Tannenbaum et al, on the 
other hand, make another special contribu- 
tion through comparing judgments by ama- 
teurs (comparable to our Ss) with judgments 
by professional typographers and by students 
of typography. In showing general agreement 
among these three kinds of judges, they dem- 
onstrate quite directly that the connotative 
meaning of typefaces can be utilized in com- 
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munication of intended meaning from artist 
to public. 
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This paper deals with the development of a Guttman-type scale of attitudes of 
southern industrial managers toward organized labor and the extent to which 
the results of such quantified statements differ from those of industrial managers 


outside the South. A 7-item scale was 


obtained. A comparison of the outcomes 


of these items with measures of attitudes of industrial managers outside the 
South indicates the presence of a number of similarities and differences in pat- 
terns of responses related to matters of the role of the union, the problem of 


union shop, collective bargaining, and 


Writers in the field of industrial relations 
have long been interested in studying the atti- 
tudes held by managerial personnel toward or- 
ganized labor. Attempts to develop quantified 
statements of these attitudes have been 
launched with varying degrees of success 
(Chalmers et al., 1953-54; Derber et al., 
1960; Stagner, Chalmers, & Derber, 1958). 
Most of these attempts have been made in 
large industrial corporations in highly indus- 
trialized regions and the measures devised re- 
flected this situation. No attempts have been 
made, on the other hand, to develop quanti- 
fied measurements of managerial attitudes in 
southern industrial centers.? As a first step in 
the direction of accumulating evidence on this 
region, this paper presents the findings drawn 
from a study of managerial attitudes toward 
organized labor in one southern industrial 
center. 

The experience of the industrial South with 
labor does not conform to many of the find- 
ings of industrial relations in other regions 
(Heberle, 1948; Simpson & Norsworthy, 
1965), At least three factors are responsible 


1 Revised version of a paper read at the annual 
meeting of the Southern Sociological Society, New 
Orleans, April 1966, The paper is based on data 
collected for a doctoral dissertation submitted by the 
senior author to the Department of Sociology, 
Louisiana State University, August 1965. 

? The southern region used in this paper corre- 
sponds to the Southeast of Odum (1936) in his 
Southern Regions of the United States. 


the process of decision making. 


for this: First, industrialization in the South 
began later than in other areas of the nation 
with significant consequences for management- 
labor relations; second, the social structure 
of the South, especially patterns of social 
stratification and the racial composition of 
the population strongly influenced the labor 
movement in the region (Heberle, 1959; 
Vance & Demerath, 1954); and third, the 
types of industries “creeping” southward pro- 
duced few craft, white-collar, and professional 
jobs. The result has been an overrepresenta- 
tion of the lower occupational strata (Simpson 
& Norsworthy, 1965). 

To these factors one can add the fact that 
organized labor is weaker in the South than 
in other regions of the country. Both the pro- 
portion of organized labor and the rates of 
unionization of southern workers are below 
those of the nation as a whole. In 1953 the 
proportion of organized labor in the South 
was 17.1% of the total nonfarm labor force 
(compared to 32.6% for the nation). Until 
1953 the average rate of unionization of the 
southern labor force was only 57% of the na- 
tional rate. Although the South increased its 
proportion of unionized labor in recent years, 
it is still considered to be underorganized 
(Roy, 1965). 

The effects of these characteristics upon 
labor-management relations have been the 
subject of a number of studies. Heberle 
(1948) has maintained that the spread of 
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unionization in the South will result in indus- 
trial conflict different from that experienced 
in the North largely because of the tradition- 
ally paternalistic pattern of labor relations in 
the South. Confining his observations to south- 
ern textile industry, Gillman has noted that 
common customs and mutual interests so 
characteristic of this industry have created 
harmonious and sympathetic management- 
employee reliations which may be classified 
as “folksy” (Gillman, 1956). Roy (1965) an- 
alyzed changes in labor management in the 
South in terms of several factors that include 
the growth in influence of absentee-top man- 
agement groups, the shift in the type of asso- 
ciation between management and labor from 
“shirt-sleeved informality toward impersonal 
business relationships,” and the replacement 
of early benevolent paternalism by pseudo- 
paternalism and coercion. 

The foregoing studies clearly indicate that 
employee-management relationships in the 
South are markedly different from those of 
other regions but have recently undergone 
significant changes. Thus earlier forms of 
these relationships were characterized by folk- 
siness, indigenous labor force, and a common 
culture, Benevolent paternalism provided 
these relationships with a strong shield against 
the encroachment of alien ideologies, thus pre- 
serving continuous harmony and cultural soli- 
darity. More recent forms of management- 
labor relations have emerged during the past 
few decades with rapid industrial growth and 
the diversification of products. Basic to these 
new patterns is the growth of absentee owner- 
ship and “the replacement of Uncle Ben with 
college-trained officials” (Roy, 1965). Man- 
agement in the South is now forced to deal 
with labor in terms of policies originating out- 
side the region. Also, the recent industrial 
expansion has brought about a new surge of 
labor efforts to unionize workers of the region 
which has heightened labor’s awareness of its 
self-interest and accelerated the breakdown 
of paternalism. 

Differences in patterns of labor-manage- 
ment relationships between the South and the 
rest of the country are subtle and require a 
great deal of systematic analysis. In addition, 
southern industrial centers are currently ex- 
periencing a variety of types of relationships 
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due to differences in level of industrial growth, 
type of product, and geographical location 
that it would be difficult to speak of an inclu- 
sive pattern. Preliminary attempts must be 
made, however, to reveal some aspects of these 
differences so that more comprehensive studies 
could be undertaken in the future. In this 
paper the field of attitudes of industrial man- 
agers toward organized labor is explored. The 
problem of the study is twofold: (a) to assess 
the kinds of attitudes industrial managers in 
the South hold toward organized labor, and 
(5) to assess the extent to which southern 
managerial attitudes differ from those held 
by managers in other regions. The strategy 
employed to accomplish these tasks was to 
develop a Guttman-type scale of attitudes of 
industrial managers in a southern city and 
then to compare the findings with available 
scales and other indexes of attitudes held by 
management outside the South. Since many of 
the instruments used in other studies con- 
tained items referring to local situations, they 
were not suitable for research in the South. 
For this reason, an attempt was made to de- 
velop a scale which would yield the desired 
data on southern managerial attitudes while 
at the same time being suitable for regional 
comparisons. Given the importance of this 
methodological problem, the technique used 
in scaling and the technical characteristics of 
the scale will be described in some detail. 
The data were gathered from five bulk raw- 
material processing plants located in an ad- 
ministrative-industrial center in the Southern 
United States with a metropolitan area popu- 
lation of about 275,000. These corporations 
have some form of labor contract with either 
an independent labor union or a union afili- 
ated with the AFL-CIO. Seventy lower-level 
officials and 20 top management executives 
employed by these corporations were inter- 
viewed. Although separate interview schedules 
for each of the two levels of management were 
Prepared and used throughout the study, 
items making up the scale presented here were 
identical for the two managerial levels. 


ATTITUDES TOWARD LABOR UNIONS 
Items of the Scale 


Attitudes of management toward labor 
unions were measured by seven items, each 


— lour- response categories, These 
were assigned weights ranging from 
Ü for strongly favorable to 3 for strongly un- 
favorable. The items are as follows: 


of Labor unions not only better thelr members’ 
C economic and social conditions but ake com- 
tribute to the welfare of society. 
. The drive to organize workers is a legitimate 
_ fight for labor unions, 
3. Collective bargaining is a necessary and im- 
portant function of labor unions that should be 


preserved. 
4, Labor unions are reasonable in their drive to 
= recruit workers into their organizations, 

5. Labor unions’ demands regarding wages, hours 
ol work, and working conditions, are in most 
cases, fair and reasonable. 

8 Labor’s demand for the union shop is a legiti- 
1 mate richt for them. 
_ 7. Labor unions should have a share in making 
4 decisions in the corporation. 


Scaling Technique 


-Using the scalogram board technique (Gutt- 
man, 1950), responses of the two groups of 
“managers to the above items were scaled in 
one unit. All items were dichotomized in the 
_ Second trial. The items formed a “true scale” 
a coefficient of reproducibility of .90 
a was obtained, and the errors were randomly 
distributed (Guttman, 1950). The final order 
of the items was as follows: 1, 2, 3, 5, 6, 4, 
and 7. 
The marginal frequencies of the seven items 
are shown in Table 1. Item 7 was retained 
although it failed to meet Guttman’s criterion 
that an item should have 90 to 10% marginal 
` frequencies to be included in a 10-item scale. 


ATTITUDE SCALE or Upper AND LOWER 
MANAGEMENT OFFICIALS 


Marginal frequencies percent 


TABLE 1 
MARGINAL FREQUENCIES OF THE ITEMS OF TRE 
7 


Item* — — —iꝛ 
Disagree Agree 
| 
1 18 82 
2 23 77 
3 52 48 
5 58 42 
6 61 39 
4 72 28 
7 93 7 


. ͤ ee 
* Items are listed according to their final order in the scale, 
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TABLE 2 


Deruecnor or Lowes wo Urere Maxioruew? 
Orretara w ree Artereees Sous Tyres 


Perfect Swale iteme * 
scale — 71 — r — — 
type | a 9 Na} % 

— A ES E = Seen 

Traore Ss 2s a ey eee 
n z= = 22 „* a] 82) 1333 
m ee eee SM o” 
IV oe o 
y ECR S 4 au“ 
VI wee 8 SSS 
vu z| 232| mes 
Total 2% | 100 


It was decided that marginal frequencies of 
93 to 7% are close enough to the minimum 
requirement which is, in itself, an arbitrary 
limit. 

Scores in the scale ranged from 0 to 7. Only 
one respondent scored 0 and one other respon- 
dent scored 7. Both respondents were lower 
managerial officials. Upper management offi- 
cials contributed fewer than their pro rata 
errors (11% of the total number of errors in 
the scale). Their scores, however, were dis- 
tributed in a consistent pattern. 

Seven scale types were established with 
respondents ranging from 12% in the first 
scale type (classified as having the strongest 
negative attitudes toward labor unions) to 3% 
in the seventh type (considered to have the 
strongest prounion attitudes).* Table 2 shows 
the number of respondents in each scale type. 

The theoretical framework of the scale in- 
cludes the assumption that matters of agree- 
ment and disagreement between labor and 
management can be used to identify manage- 
ment feelings toward labor unions. Matters 
like the role of unionism in industrial society, 
the demands made by unions to improve 
workers’ social and economic conditions, the 
significance of collective bargaining, labor’s 


3Stressing negative rather than positive attitudes 
toward organized labor is intentional since the paper 
is derived from a study of a set of hypotheses deal- 
ing with the impact of work orientations of man- 
agers upon their negative attitudes toward labor 


unions, 
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demand for the union shop, and labor partici- 
pation in making decisions in the corporation 
were considered a valid basis for assessing 
attitudes of both upper and lower manage- 
ment personnel toward labor unions. 

Polarization of attitudes and the likelihood 
of an attitudinal continuum is supported by 
the fact that the majority of the respondents 
agreed with the statement that labor unions 
have a vital role in the growth and welfare 
of modern society (Item 1), and disagreed 
with the statement that unions should have 
a share in making decisions in the corporation 
(Item 7). Between these two extremes, re- 
sponses fell into a gradient pattern with the 
number of disagreements per item decreasing 
gradually toward Item 1 and increasing 
toward Item 7. 

The final order of items in the scale reveals 
that respondents considered the existence of 
labor unions to be important to society. They 
also agreed that union efforts to recruit mem- 
bers are legitimate. On the issue of collective 
bargaining, however, responses leaned more 
to the negative side of the scale. Relatively 
high disagreement did not begin until re- 
spondents were asked to express their feelings 
toward the union shop. Disagreement became 
greatest (most widespread) when opinions 
were expressed about the legitimacy of labor 
to have a share in making decisions in the 
corporation.* 

Judging respondents on the basis of their 
scores in the scale, four levels of attitudes 
were established: high negative, moderately 
high negative, moderately low negative, and 
low negative. The distribution of respondents 
in these categories is shown in Table 3. The 
table shows that the majority of respondents 
are concentrated in the two middle categories 
and that the two extremes have almost equal 
numbers of cases. The authors are disposed 
to conclude that southern managers have mod- 
erately negative attitudes toward organized 
labor. 

The distribution of negative attitudes 
among managers of the five corporations is 
shown in Table 4. More than two-thirds of 


Using Schuessler's Test I, it is concluded that the 
observed frequencies of the scale types exceed the 
frequencies expected by chance. See Schuessler (1961) 
for fuller details. 
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TABLE 3 


CLASSIFICATION OF MANAGEMENT BY ATTITUDES 
TOWARD LABOR UNIONS 


No. 90 


Attitude Score | 
High negative 7,6 13 14.44 
Moderately high negative} 5,4 38 42.22 
Moderately low negative 3,2 27 30.00 
Low negative 1,0 12 13.33 
Total 90 100.00 


the managers in Corporation A show low-nega- 
tive attitudes. On the other hand, Corpora- 
tions B, C, D, and E are characterized by 
high-negative attitudes with varying degrees 
of intensity. The striking thing about this 
distribution is the relationship it bears to size 
of the corporation. Specifically, the largest 
corporation has the least negative views while 
the “medium-sized” and small corporations 
have the strongest negative views. While this 
finding may be an artifact of the specific 
union-management relationships in the vari- 
ous organizations, the authors were unable to 
find evidence to support this view. They are 
led to conclude, then, that factors associated 
with size of the organization influence man- 
agerial attitudes toward unions. Specifically, 
the less negative views found in the largest 
corporation may reflect the fact that labor- 
management relations have reached a highly 
stabilized pattern based upon common goals 


TABLE 4 


DISTRIBUTION or LEVELS or NEGATIVE ATTITUDES 
IN THE Five CORPORATIONS 


Attitudes toward 
labor unions 
~ _| Approx. 
i High Low Total 
employees] negative negative 
No. % No. % 
A 2200 6 26.00 17 73.91 23 
B 720 18 72.00 7 28.00 25 
Cc 700 | 12 | 66.66] 6 33.33 18 
D 125 9 56.25 7 43.75 16 
E 100 6 75.00 2 285.00 8 
Total st 39 90 
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and mutual interests, and that leadership of 
both sides has developed specialized forms of 
communication to channel differences between 
them. It is also possible that unions constitute 
less of a threat to managers of large organi- 
zations than is true in smaller corporations. 
On the other hand, it seems likely that the 
“paternalistic” pattern of labor relations dis- 
cussed by Heberle and the subsequent de- 
velopment of “pseudo paternalism” and “co- 
ercion” suggested by Roy, is more likely to 
be a strategy employed by the middle-sized 
and smaller corporations, If this is indeed the 
case, then the strong negative views found in 
the middle-sized and small corporations may 
reflect the managerial reaction to the strong 
conflict provoked by labor-management dis- 
agreements in such a system. 


Attitudes and Organizational Commitment 


Previous studies have examined the corre- 
lation between attitudes toward labor unions 
and liking or disliking for the company in 
order to demonstrate the existence of what 
has been called a “common attitudinal cli- 
mate” among industrial workers and man- 
agers. Chalmers and his associates found that 
where industrial workers showed a liking for 
management, they expressed positive feelings 
toward unions and vice versa (Chalmers et al., 
1953-54). On the other hand, tests of such 
relationships among industrial managers 
showed an absence of significant association 
(Stagner et al., 1958). 

In the current study attempts were made to 
test the association between attitudes of lower- 
level managers toward labor unions and levels 
of commitment to the employing corporation. 
Organizational commitment was measured by 
responses to seven items dealing with feelings 
of managers regarding their work in the com- 
pany, with their willingness to help the com- 
pany during a crisis, and with their sense of 
identification with the corporation. The scalo- 
gram board technique was used to rank these 
items. In the final trial only five items were 
scalable with a coefficient of reproducibility of 
90. Because the number of items was small 
and the requirements of a “true scale” were 
not fulfilled, details of the scaling procedures 
are not reported here. 
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The correlation between negative attitudes 
toward labor unions and organizational com- 
mitment indicates that the relationship is not 
significant.* Obviously this finding means that 
respondents who have high negative attitudes 
toward labor unions are not committed to 
their organization. While this may be correct, 
the authors must point out that their measure 
of organizational commitment may have influ- 
enced the outcome of the test. On the other 
hand, they take refuge in the finding of the 
Illini City study which found a “common 
attitudinal climate, and conclude that man- 
agers who have strong negative attitudes 
toward labor tend to lack strong commitment 
to the organization. 


Discussion 


How do the attitudes of southern industrial 
managers toward labor unions compare with 
the views held by those outside the South? 
Here it is instructive to compare the findings 
of the results of two items of the scale with 
findings of a Gallup poll made public at al- 
most the same time the data of the study were 
collected (Gallup, 1965). The poll asked 2 
national sample: “In general do you approve 
or disapprove of labor unions?” In the Gallup 
sample 23% of the nonunion respondents dis- 
approved of labor unions. In the current study 
of managers 18% disagreed with the state- 
ment: labor unions “not only better their 
members’ economic and social conditions but 
also contribute to the welfare of society.” 

A second question in the Gallup poll asked: 
“Do you think a person should or should not 
be required to join a union if he works in a 
unionized factory or business?” Of the non- 
union respondents in the poll, 56% replied 
negatively. The present sample of southern 
managers elicited 61% negative replies to 
Item 6: “Labor’s demand for the union shop 
is a legitimate right for them.” 

It should be noted that the differences in 
percentages between southern managers and 
nonunion Gallup poll respondents for both 
items is rather small. In both instances the 
95% confidence interval around the southern 
managerial sample proportions includes the 
proportion from the Gallup poll. Although 


5 r= 035; p(z = —417) > 05. 
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the specific wording of the items is somewhat 
different, in both cases the meanings are simi- 
lar. Evidently southern managers do not dis- 
agree greatly with Gallup’s nonunion respon- 
dents. 

This should not be surprising, nor does it 
necessarily contradict Stouffer’s findings that 
Southerners are less tolerant than the rest of 
the nation. Stouffer (1955) found that the 
amount of schooling and type of community 
(rural-urban) of his cross-section respondents 
influenced their level of tolerance.“ Thus, the 
higher the educational level the more likely 
the individual is to show tolerance of non- 
conformists. Similarly, urban respondents 
tended to score relatively higher on the toler- 
ance scale than rural respondents. In the cur- 
rent study, respondents are urban residents 
and have achieved higher levels of education 
than the average Southerner. Thus, the rela- 
tively high proportion of southern industrial 
managers approving of labor unions may re- 
flect a tolerance that is due to their educa- 
tional achievement and urban residency," 

Viewed from a different perspective, the 
relatively mild reaction of southern manage- 
ment to the existence of labor unions, espe- 
cially in large corporations, can be attributed 
to the weakness of unionism in the South. It 
was pointed out earlier that southern labor 
unions have not reached the strength achieved 
by unions outside the southern region.® 

The findings of the present study are most 
meaningful when compared to similar studies 
of managerial attitudes outside the South. The 
most comparable study is one in which Stag- 
ner et al. (1958) developed an 8-item Gutt- 
man scale to measure attitudes of managers 
to labor unions and a 9-item scale of union 
attitudes toward management. The authors 
found that in most organizations “manage- 


€ By tolerance Stouffer means “willingness to grant 
certain rights to people whose views might be dis- 
approved, such as the right to speak, the right to 
hold certain kinds of jobs, etc.” See Stouffer (1955) 
for details, 

The educational factor should probably be given 
more weight since urbanism by itself, as Stouffer 
points out, did not account for a significantly high 
level of tolerance in the southern region. 

»The South is still considered by one union or- 
ganizer of being the “new frontier? for organized 
labor. See Conney (1961). 
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ment representatives were either moderately 
or very favorable toward the union” (Derber 
et al., 1960). They also found that managers 
were favorably disposed toward unions when 
asked to respond to more specific issues such 
as the role of the union in influencing the 
manner in which the organization was run, 
whether or not union claims were considered 
reasonable, and whether or not the amount of 
power enjoyed by union representatives was 
appropriate. 

The findings of the Stagner et al. study 
differ from those reported in this study in sev- 
eral respects. First, in contrast to the generally 
favorable attitudes of the Northern managers, 
the Southern managers had a generally nega- 
tive attitude toward organized labor. Second, 
even more pronounced differences exist be- 
tween the two groups when the specific items 
of the two studies are found. While responses 
to the item dealing with contract negotiations 
“tended to be concentrated at the more fa- 
vorable or friendly end of the scale” in the 
Stagner et al. study, responses of the South- 
ern managers to the question dealing with 
collective bargaining clearly lean toward the 
negative side of the scale. (See Table 1.) To 
the question of labor unions’ power and their 
interference in areas of production and dis- 
cipline the general pattern of responses re- 
ported in the Stagner et al. study is favorable 
although “extensive management criticism 
was apparent in many establishments.” The 
current study, on the other hand, shows a 
rather overwhelming majority of negative re. 
sponses to the question dealing with the legiti- 
macy of labor’s demands for a union shop 
and “a share in making decisions in the cor- 
poration.” 9 


SUMMARY AND CONCLUSIONS 


In the absence of an instrument for mea- 
suring attitudes of southern managers toward 
unions, this paper has attempted to develop 


3 

Scale items in the Stagner et al. study have å 
great deal of resemblance to the corresponding items 
of the current study. The basic difference is that 
questions in Stagner’s study ask reaction of manager 
to relationships between themselves and the union 
which exist in actuality whereas items of the cul 
rent study are simple declarative statements overly 
implying neither normative nor personal preference 


MANAGERIAL ÅTTITUDES Towarp Lapor Untons 


a Guttman-type scale for industrial managers 
in a southern city. The instrument developed 
met the requirements of a “true scale” as 
specified by Guttman. Perhaps one reason for 
the success of this scale is that the items pre- 
sented to the respondents were generalized 
statements. This confirms the Stagner et al. 
conclusion that generalized items tend to scale 
better than specific items, The scalability of 
the items used in this study leads the authors 
to conclude, with the Stagner et al. study, 
that a Guttman scale represents a highly re- 
liable way to ask managers “do you approve 
of the union?” (Stagner et al., 1958). 

A comparison of the characteristics of the 
scale developed on southern managers with 
those developed on populations outside the 
South revealed a number of important dif- 
ferences. It is probably correct to conclude 
that most of the differences arose when ques- 
tions focused on specific issues. The instru- 
ment itself seems to be more applicable to 
small corporations in areas not highly indus- 
trialized, and in regions where unionization 
has not dominated the modes of grouping 
employees. 

A final word is in order. The findings of 
this study must be qualified by one limita- 
tion. That is, some of our “southern” man- 
agers, especially top executives, may not be 
southern in origin. It was pointed out that 
absentee ownership has become an important 
factor in shaping labor-management relations 
in the South. Given the fact that most of the 
corporations included in this study are ab- 
sentee owned, and that the rate of geographi- 
cal mobility of industrial executives is high, 
it is possible that some of our findings may 
reflect the influence of nonsouthern managers. 
In fact, the major cleavage in southern indus- 
trial organization may involve a difference be- 
tween top and middle management with re- 
gard to their views on organized labor which 
is a function of the distinction between large 
absentee-owned corporations and smaller in- 
digenous organizations. 
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A study was designed to determine whether the effectiveness of programmed 


material presented by teaching 


machine and programmed text varies with 


Kuder interest pattern. Specifically, 30 introductory psychology students with 
high mechanical and low literary interest and 30 low mechanical and high 


literary interest students learned 2 


Programs. 4 of each group learned the 


Programs via teaching machine while the Temaining halves were instructed by 
Programmed text, While no interest patterns or mode of presentation effects 
were uncovered, a significant Interests X Mode interaction appeared for the 
longer of the 2 programs. The implications of the results for educational and 


industrial training Programs are discussed, 


Studies which have compared programmed 
texts and teaching machines as modes of pre- 
senting programmed material generally find 
few differences in acquisition and retention 
differentiating the two approaches (e.g., 
Eigen, 1962; Goldberg, Dawson, & Barrett, 
1964; McKeown, 1965. Also cf. Goldstein, 
1964; Goldstein & Gotkin, 1963). However, 
it is possible that, while the two modes of 
presentation might be comparable with groups 
of unselected Ss, individuals might respond 
differentially to the two procedures as a func- 
tion of personality/interest pattern differ- 
ences. In fact, a related suggestion has been 
made by Ripple, OReilly, Wightman, and 
Dacey (1965), who feel that personality char- 
acteristics of the learner may differentially af- 
fect learning under programmed instruction 
and conventional teaching regimens, 

A particularly intriguing hypothesis sug- 
gests that Ss may learn differentially from 
teaching machines and programmed texts as 
a function of interest differences. For ex- 
ample, Ss with high mechanical interest might 
be expected to learn better from teaching 
machines than persons with high literary in- 
terest, while the latter group might “outlearn” 
the former on material presented by pro- 


*The authors are indebted to Vi Kohorst and 
Kate Tidd for their assistance in the Preparation of 
this paper. 


grammed text. The present study was de- 
signed to determine if interest patterns reflect- 
ing literary or mechanical preferences are 
related to the differential effectiveness of text- 
and machine-presented programmed materials. 


METHOD 
Subjects 


The Ss were selected from 93 male volunteers 
drawn from two introductory psychology courses at 
St. Cloud State College. All of the students were 
administered the Kuder Preference Record, Fom 
CH. On the basis of these scores, two groups were 
formed: Group Lui-Mi. (above the total sample 
median on the Literary scale and below the 191 ö 
sample median on the Mechanical scale) an 
Group Mur-Lie (below the total sample median on 
the Literary scale and above the total sample 1 
on the Mechanical scale). Those Ss who did me 
conform to these combinations of scores were elimi- 
nated from the experiment at this point. 
because the statistical design employed required ce 
to-cell proportionality, some additional Ss were m 
domly eliminated and the study was run with 30 Mar 
Lie and 30 Lur-Mie Ss, half of each sample = 
assigned to a teaching machine group and half m 
programmed text group. Minnesota Scholastic 15 
tude Text scores were available on 51 of the 60 1. 
Permitting a rough evaluation of the intellectual 15 
ities of the four resulting samples, An F test run b 
tween the means of the four samples proved a 
nonsignificant (F 200, df=3/47; M = 34.0, 
= 14.1); it appears likely, therefore, that intellectu 
differences did not materially affect the results. 


e 
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PROGRAMMED INSTRUCTION AND MEASURED INTERESTS 


Programs 


Two short linear programs were prepared es- 
pecially for the project.* One, arranged in a fading 
format and consisting of 19 frames, was designed to 
lead to memorization of the following fictitious busi- 
ness address: 


Mr. Karl G. Lunden 

Chief of Product Development 
314 Brewster Building 

101 Hollyhock Drive 

Mesa, California 


The second program was intended to teach the prin- 
ciple underlying the digit- letter sequence: 


ND 


This program consisted of 21 frames with over- 
lapping stimuli. Both programs required constructed 
responses. The machine presentation was imple- 
mented by means of a MTA-100 Scholar teaching 
machine. The special booklets prepared for the text 
presentation were arranged in a horizontal format. 


Procedure 


The 30 Ss each in the D-Mie and the Myi-Li 
groups were assigned randomly to either a machine 
or a text treatment condition. Upon arrival at the 
experimental room each S was exposed to printed in- 
structions regarding the use of the relevant presenta- 
tion mode. These instructions were comparable ex- 
cept for necessary references to either the machine 
or the text. While the machine program was ad- 
ministered individually, the booklet was adminis- 
tered to students in groups of about 15. The Ss were 
given all the time needed to complete the experi- 
mental programs assigned to them. (The average 
time required for the program was about 15-20 
min.) After completing the program Ss were excused 
With the explanation that they would be called back 
at the same time the following day. No description 
of the content of the second session was offered at 
this point. The next day Ss were examined for re- 
tention of the programmed material. This involved 
instructions to reproduce the business address and 
the digit-letter series learned on the preceding day. 
The dependent variable used for analysis was the 
number of individual characters correctly recalled. 
Thus, in the business address there was a total of 
88 letters and numbers, and a perfect score neces- 
Sitated the recall of all characters in the correct 
order. Correspondingly, a maximum of nine charac- 
ters could be recalled in the digit- letter sequence. 


RESULTS 
Business Address Retention 


The means of the total number of business 


address characters recalled after 24 hr. are 
— eet 


* Copies of the programs are available from the 
Senior author upon request. 
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TABLE 1 


Means ror Tota, Nun or Business Apparss AND 
Diatx-Lxrrxa Cuaracters RECALLED AFTER 


24 He. 
lll 
Group Machine | Text | Total 

Lar-My* 

Address 50.00 62.80 56.40 

Digit-letter 7.28 6.67 698 
MarL,," 

Address 66.74 51.07 58.90 

Digit-letter 7.87 7.87 7.87 
Total 

Address 58.37 56.94 57.66 

Digit-letter 7.58 7.27 7.42 


Note.— is for each cell. 

* Above the total sample median on the Literary scale and 
below the total sample median on the Mechanical scale. 

> Below the total sample median on the Literary scale and 
above the total sample median on the Mechanical scale. 


summarized in Table 1. Differences among 
the means of the rows and columns were 
tested for significance at the Os level by a 
two-day analysis of variance. Table 2 sum- 
marizes this analysis. 

The implications of these data are as fol- 
lows: 


1. It is obvious that for the combined sam- 
ples there is no significant difference between 
the effectiveness of the programmed text and 
the teaching machine. 

2. When data from the programmed text 
and the teaching machine are combined there 
is no significant difference between the per- 
formances of the two interest groups. 

3. A significant interaction is observed be- 
tween interest pattern and mode of presenta- 
tion. By comparison with the programmed 
texts, the teaching machine appears to be a 
more appropriate technique for those with 
high mechanical and low literary interests 


TABLE 2 
ANALYSIS OF BUSINESS ADDRESS RETENTION 


Source SS af MS F 
Machine-Text 30.8 1 30.8 | <1.00 
Group 93.7 1 93.7 | <1.00 
Interaction 3038.8 1 3038.8 4.02* 
Within cells 42294.3 56 755.26 

Total 45457.6 59 
*p<,05. 
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while the opposite is true for those with low 
mechanical and high literary interests. 


Digit-Letter Retention 


The means for the four samples on the 
digit-letter task are also summarized in 
Table 1. 

The digit-letter retention scores were 
analyzed in the same manner as the business 
address data in Table 1. However, as neither 
the differences among marginal means nor the 
Row X Column interaction approached sig- 
nificance, the analysis is not described in a 
separate table. The Fs for interest pattern, 
mode of presentation, and interaction were 
1.50, .17, and .16, respectively, all d/s = 
1/56. 


Discussion 


It is evident that with at least some pro- 
grams (e.g., the business address program) a 
significant interaction exists between interest 
pattern and mode of presentation. 

A sophisticated theoretical analysis is not 
needed to account for the differences observed 
between the Lur-Mio and the Mur-Lio groups. 
A student who “likes” things mechanical and 
“dislikes” printed materials might be expected 
to respond more favorably to the teaching 
machine on motivational grounds. Conversely, 
an S who “likes” books but “dislikes” ma- 
chines might be expected to exhibit an oppo- 
site effect relative to the two presentation 
modes. 

The results may have particularly broad 
implications for future development in the 
programmed instruction field. They suggest 
that training programs for groups with high 
mechanical interests (presumably factory 
workers, tradesmen, etc.) might best be 
structured for machine presentation while 
those designed for persons with stronger liter- 
ary interests (e.g., students, clerical workers) 
are probably best presented as programmed 
texts. Additional research on the relationships 
between interests and differential effectiveness 
of various types of programs seems to be very 
much in order. 

As has been noted, no interaction effect 
was observed with the digit-letter program. 
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This variable influence of interest patterns 
upon retention may be referable to specific 
differences in the two programs. First, a fun- 
damental difference in format is to be noted. 
The business address program employed a 
fading format in which the general instruc- 
tional goal is memorization, The digit. letter 
program, on the other hand, was designed to 
teach a particular principle from which the 
construction of a particular series of charac- 
ters could follow. 

Second, the difference in the number of 
characters in the two programs is to be noted. 
While a maximum of 88 correct responses 
were involved in the retention of the business 
address, the digit-letter series included only 9 


characters. Thus, the variability in recall was 


restricted in the latter program due to a much 
lower maximum “ceiling” of retention. 

In any event it would appear that with 
some programmed materials arranged in cer- 
tain formats, retention varies as a function of 
particular patterns of measured interests. The 
more specific nature of this interaction awaits 
further analysis as does the relationship be- 
tween interest and personality variables, and 
criteria other than retention. 
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This research studied the extent of individual need fulfillment of career United 
States Army officers at 3 different levels in the organizational hierarchy. The 
relative importance which the officers attach to the needs was also investigated 
A sample of 504 officers in the ranks of captain through colonel responded to a 


mailed questionnaire, Conclusions are: 


fillment tend to increase at successi 


(a) Perceived deficiencies in need ful- 
ively lower levels in the organizational 


hierarchy, (b) In general, need-fulfillment deficiencies are greatest for the needs 
in the ego and self-actualization categories. (c) The deficiencies which appear 
to have the most deleterious effect on the accomplishment of Army goals are 
those associated with the opportunity to exercise authority and the comparative 
amount of recognition received in command and staff positions. 


Like many industrial organizations, the 
United States Army devotes considerable time 
and effort to the problem of recruitment and 
retention of qualified managerial (officer) 
personnel. Another problem of equal im- 
portance, which has received perhaps less at- 
tention, is that of obtaining maximum effort 
and value from those officer personnel who 
have committed themselves to a military 
career. Myers (1964) and others provide 
evidence to suggest that certain support fac- 
tors such as good working conditions, fringe 
benefits, and fair practices may reduce dis- 
satisfaction to the extent that individuals will 
Stay with the organization, but that a differ- 
ent group of factors causes managers to be 
highly motivated toward their work. Myers 
believes that this latter group, which he calls 
“motivators,” includes the factors of achieve- 
ment, growth, responsibility, and recognition. 

McGregor (1957) interprets this same mo- 
tivational problem in terms of the hierarchy 
of needs developed by Maslow (1954). He 
feels that organizations continue to concern 
themselves with satisfying the physiological 
and safety needs of their employees through 
the provision of better wages, better working 
conditions, and more fringe benefits even 
after these two lower level needs have been 
quite well satisfied. The greatest mistake, Mc- 
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Gregor believes, stems from management's 
failure to recognize that employee dissatisfac- 
tion and failure to put forth more than mini- 
mum effort derive from the lack of opportuni- 
ties to satisfy their social, ego, and self-ful- 
fillment needs. The needs contained within 
these three higher level categories, particu- 
larly the ego and self-fulfillment needs, closely 
parallel Myers’ “motivators,” 

In a series of studies of industrial manage- 
ment personnel, Porter (1961, 1962, 1963a, 
1963b, 1964) assessed the extent of per- 
ceived deficiencies in need fulfillment as a 
function of job level and of line versus staff 
type of job. Using questionnaire items classi- 
fiable into a Maslow-type need hierarchy sys- 
tem he found that the largest deficiencies 
existed in the two highest-order need cate- 
gories and that deficiencies tended to in- 
crease at each successively lower level of the 
management hierarchy. He also found that 
line managers perceived greater need fulfill- 
ment than staff managers but did not differ in 
regard to the importance they attached to 


each type need. 
PURPOSE 


The purpose of the present study was to 
examine the extent of individual need fulfill- 
ment of regular United States Army officers 
at three different levels in the organizational 
hierarchy. Deficiencies in need fulfillment 
were assessed by determining to what extent 
an incentive is provided for each need versus 
the extent to which an incentive should be 
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provided for cach need, The discrepancy be- 
tween what is provided versus what should 
be provided is an indirect measure of the sat- 
isfaction or dissatisfaction with each need. An 
investigation was also made of the relative 
importance which the officers attach to the 
needs used in the study. 


Mernop 
Subjects 


officers in the three “combat” branches, 
Armor, Artillery, and Infantry. Officers in these 
branches experience a similar and reasonably clear- 


Using Maslow's need hierarchy as a guide, the 
needs shown in Table 2 were selected for investi- 
gation. 

Interest in the extent of perceived 
need fulfillment prompted the use 
constructed 


TABLE 1 
Orricxxs SAMPLED BY RANK CLASS 


Lt. Colonels/ 
Colonels 


Note.—N =504, 
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TABLE 2 


Srzcinic Nexps sy Nero Catecory Ixcompey 
INTO QUESTIONNAIRE 


— SP lee 


Safety Financial security 
Social Association and lasting fri 
with high-caliber persons 
Ego 
Self-esteem Authority 
Independent thought and 
Reputation Prestige (from people outside 
military) y 
Recognition (credit for 
ment) in command position 
Recognition (credit for accom 
ment) in staff positions 
Self-actualization | Realizing full potential 


Feeling of worthwhile accom 
ment 


thought and action, prestige, realizing full pot 
and a feeling of worthwhile accomplishm 
first question for each of these needs, the 
the officers to compare need fulfillment opp 
in the Army to other careers with responses 
point Likert scale of less to more. The sec 
tion for each need was similarly constructed ex 
for dropping off the lowest response (less) and. 
ing a response to the upper end (much mor 
was done in an effort to obtain a more 
distribution of responses to the second q 
it was unlikely that a respondent 
that he should have less opportunity to 
Particular need than he is now being 
identical scales for both questions would f 
force the answers to the second question into 
upper end of the scale. The second category of 
consisted of those which had an internal orie, 
and did not lend themselyes to a comp 
proach. These needs included those for autho! 
for recognition in command and staff 
Questions concerning these needs were 
terms of how much recognition do you and $ 
you receive and how much authority do 
should you get to exercise, considering the a 
responsibility associated with the job. 
tated the use of a low-high scale with resp 
the first question for each need of low, si i 
low, neither especially high nor low, somewhat 
and high. As with the first category of ques 
responses to the second question were altel 
dropping off the lowest response (low) and 
a response to the upper end (very high). An 
ended question was used to solicit any commen 
Ss desired to make regarding their responses t 
two questions pertaining to each need. Fi all 
Were asked to rank in order of importance to 
the specific needs under study. The questionm 
was pretested on 20 locally available officers. 
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RzsuLTš TABLE J 
Perceived need-fulfillment deficiencies were e Durresesee Sows sy Rasen Crass on a 
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1 
H 
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ol dissatisfaction among the various needs 


T 
i 
7 
i} 

8 ijs 


4. 
; 
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which is accorded their rank by 
side the military while the majors and 


T 
75 
28 
3 
is 
ald 


ES 


majors. On the other hand, the highest 
in the group is that of the colonels’ 
authority. With six of the 
captains’ score is higher than 
and the majors’ score is 
colonels’ score. This suggests, of 
need-fulfillment opportunities are 


Hg 
Hi 
23 


H 
ig 
H 
f 
| 
EPR 
21 


TABLE 4 
Sto — , ] Älu Denctexcins ix 
IGNIFICANT Nis 


Financial security 

Association and lasting friendships with 
high-caliber persons 

Authority 

dent thought and action 


Recognition (command positions) 
Recognition (staff positions) 

Realize full potential 

Feeling of worthwhile accomplishment 


*p<.05. 
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some parametric statistic wherein the ability 
to meet the assumptions inherent in such a 
statistic would be questionable, the Kruskal- 
Wallis II test was used. Hays (1963) believes 
that this is a good rank-order method of deter- 
mining whether a group of independent sam- 
ples comes from the same or different popula- 
tions and states that it compares very well 
with the F test from the analysis of variance. 
Hays (1963) points out that once the H has 
been computed, a correction formula should 
be applied in any case where the number of 
Ss is small or where there is a large number 
of tied observations. Since there was a sub- 
stantial number of tied observations in the 
distributions of difference scores, this correc- 
tion is applicable here. The correction serves 
to increase the size of the H, so a correction 
was applied when the matter of significance 
was in question. 

Table 4 shows the results of these computa- 
tions. An overall H was computed for each 
need and, where the H was significant, three 
additional H tests were computed to deter- 
mine the difference between any two rank 
classes. It can be seen that, for the four needs 
where the overall H tests are significant, in 
each case there are significant differences be- 
tween the captains and majors and the cap- 
tains and colonels but not between the majors 
and colonels, 

The results of the investigation of the 
relative importance which Ss attach to the 
needs used in the study are contained in the 
following hierarchy of needs going from most 
important to least important: 


1. Feeling of worthwhile accomplishment. 

2. Opportunity to realize full potential. 

3. Opportunity for independent thought 
and action. 

4. Financial security. 

5. Opportunity to associate with high- 
caliber persons, 

6. Recognition. 

7. Opportunity to exercise authority, 

8. Prestige. 


No differentiation was made between recog- 
nition in command and staff positions. Based 
on an examination of the frequency distribu- 
tion of responses for each need, the median 
response was selected as the most meaningful 
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index of central tendency for the Purpose of 
determining the rank order. For si i 
only the rank order based on the entire sam. 
ple is presented. When the rank classes were 
taken separately, there were slight and gener- 
ally insignificant differences in the rank orde 
for majors and colonels. The greatest com 
sensus among all Ss existed for the two most 
important and two least important needs. 


DISCUSSION 


The Army, like the industrial organiza- 
tions studied by Porter, has apparently cre 
ated an environment in which the opportuni- 
ties for higher level need fulfillment are 
greater at successively higher levels in the 
organizational hierarchy. Such an environ 
ment is conducive to continued growth and 
development of managerial (officer) person- 
nel, and it would appear that guidance of 
junior officers should stress the ever-increas 
ing opportunity for higher level need satis- 
faction as they progress through their careers. 
On the other hand, need-fulfillment opportuni- 
ties at the lower managerial levels must be 
sufficient to prevent widespread dissatisfaction 
at these levels. 

Some interesting implications emerge when 
the obtained need-fulfillment deficiencies arè 
considered in conjunction with the relative 
importance which Ss attached to the needs. 
Based on Maslow’s hierarchy it would be pre- 
dicted that needs from the ego and self-actu- 
alization categories would be the most im- 
portant and perceived as the least fulfilled. 
This prediction generally holds up for the 
three needs perceived as most important but 
is inconsistent with the results pertaining t0 
the needs for prestige and for the opportunity 
to exercise authority. Not only are these n 
rated in the last and next to last position by 
all ranks, but also the officers perceive rela- 
tively large need-fulfillment deficiencies for 
these needs. This raises the question of 
whether these needs are truly unimportant to 
the officers studied, relative to the other needs, 
or whether the extent of perceived need-ful- 
fillment deficiencies is sufficiently large t0 
have caused the officers to downgrade the 
importance of these needs because they see n0 
hope of satisfying them. 

Regardless of the reason for the need for 
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magnitude. om Army, of necessity, must 
great importance on the development 
of commanders. A Department of the Army 
(1964) circular states that “command and su- 
pervisory positions enjoy the highest prestige 
of all positions in the Army, and there- 


_ fore are the most sought after duty assign- 


ments [p. 81. The low ranking and the dis- 


dissatisfaction with this need. Not only is it 
@ reversal of trend for the colonels to express 
more dissatisfaction with this need than the 
two lower ranks, but their score is the highest 
of all dissatisfaction scores in the study. The 
questions on authority elicited a number of 


open · ended responses which may help to ex- 


plain these results. The following quote from 
one lieutenant colonel expresses the general 
tenor of these responses: 


One factor, more than any other, promises to 
affect adversely motivation. That is, the trend 
toward highly centralized control and management 
of all military endeavor. This trend not only will 
— reduce the aggressive initiative of 
already in the Army but also will reduce the 
of younger men of high caliber who might 
Otherwise have been attracted to the challenges of 
military life. At its worst, the trend could produce a 
Mediocre Army of conformists, input data statisti- 
dans, and escapists looking for immunity from the 
Tesponsibilities of decision. 


Though the Army relies heavily on the 
Proper exercise of authority, it would appear 
that the conditions under which such author- 
ity is being exercised are less than favorable. 
A related problem stems from the findings 
relative to the need for recognition or credit 
for accomplishment in command and staff 
Positions, The data show considerably more 
dissatisfaction with the amount of recogni- 
tion received in command positions than in 
staff positions which suggests that the amount 
of recognition being given is not commensur- 
ate with the responsibility associated with 
each type position. This state of affairs can 
only serve to detract from the Army’s empha- 
sis on the attractiveness of command posi- 
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A CORRELATIONAL-CAUSAL ANALYSIS OF THE RELATIO 
SHIP BETWEEN EXPECTANCY ATTITUDES AND 
JOB PERFORMANCE 


EDWARD E. LAWLER, III + 
Depariment of Administrative Sciences, Yale University 
The causal basis for the relationship between expectancy attitudes and job 


performance was tested in this study by the use of 2 correlational approaches to 
determining causality, Data were gathered from 55 managers on the probability 


that certain rewards could 


for the time lags that are involved in expectancy attitudes causing job perform- 
nsidered. 


ance were co! 


A number of recent studies have demon- 
strated that a significant relationship exists 
between employees’ beliefs about the prob- 
ability that rewards depend upon their job 
behavior and their job performance. Lawler 
and Porter (1967a), for example, have found 
that the more managers believe rewards like 
pay, promotion, and respect from others stem 
from good performance the more likely they 
are to be rated as good performers by their 
peers, superiors, and selves. Generally, similar 
findings have been reported in four other 
studies of expectancy, reward. probability, or 
path - goal attitudes as this kind of belief has 
been referred to by different researchers 
(Galbraith & Cummings, 1967; Georgopoulos, 
Mahoney, & Jones, 1957; Porter & Lawler, 
1968; Spitzer, 1964). These studies have 
also tended to show that when expectancy 
attitudes are weighted by the value or im- 
Portance of the rewards considered, the cor- 
relations between the attitudes and job per- 
formance are increased. Lawler and Porter 
(1967a), for example, report a correlation of 
25 between unweighted expectancy-attitude 
items and a measure of performance, but a 
correlation of .38 between performance and 
the same expectancy-attitude items when they 
had been weighted by the value of the rewards 
to the individual managers. 
1The work on this project was done while the 


author was a Ford Foundation Faculty Research 
Fellow. 
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The finding that expectancy attitudes a 
significantly related to performance is, 
course, predictable from the kind of exp 
ancy theories of motivation that have rece 
been stated by Vroom (1964), ins 
(1964), and others. Earlier statements 
expectancy theory by Peak (1955) and 
(1955) also are congruent with these fin 
All these statements of expectancy 
predict that a positive relationship betwet 
expectancy attitudes and performance sh ou! 
exist because expectancy attitudes are r 
flections of the individuals motivation t 
perform. Thus, the expectancy attitudes at 
essentially seen to be causing or lea t 
the performance. Hence, the relationship 
expected not because the performance cause 
the expectancy attitudes or because a th 
variable causes both but because the © 
ancy attitudes cause the performance. 

All of the previous studies of the relati ó 
ship between expectancy attitudes and pe 
formance have used a static correlation 
approach and thus it is impossible to d 
mine the causal basis for the relation: 
found. It typically has been assumed by 
researchers that the relationship found 
about because the attitudes caused the pe 
formance but this assumption has not 
tested. As Porter and Lawler (1968) 
Pointed out, an explanation to the effect 
the attitudes caused the performance all i 
for the most parsimonious explanation of al 
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the data collected and it fits with motivation 
theory. However, before it is safe to assume 
that the causal direction runs 
titudes to the performance, direct tests of the 
causal direction are needed. The present study 
represents an attempt to fill this gap in 
literature by specifically determining 
causal basis for the commonly found relation- 
ship between expectancy attitudes and job 
performance. 

The usual way of determining whether two 
variables are causally related is, of course, 
to do an experiment in which one variable can 
be manipulated in order to observe its effect 
on another. There are, however, certain re- 
search problems that do not lend themselves 
to the experimental model regardless of 
whether a field or laboratory experiment is 
done. Experimental laboratory studies often 
do not seem to offer an adequate enough 
simulation of the real world for results to be 
used with confidence when more applied prob- 
lems are being dealt with, while field experi- 
ments present a number of practical and 
ethical problems. These limitations of the 
experimental approach are certainly present 
when consideration is given to testing for the 
causal relationship between expectancy at- 
titudes and job performance. There are, how- 
ever, two alternatives to the traditional experi- 
mental approach that would seem to offer 
ways of looking at the causal basis for the 
relationship considered in this paper. Both of 
these approaches can be classified as quasi- 
experimental or correlational-causal in nature 
since they allow causal inferences to be made 
with some degree of confidence without the 
usual experimenter (E) manipulation of vari- 
ables or E control over the situation being 
present. 

The first of these is the cross- 
lagged panel correlational method. This ap- 
proach was initially discussed by Simon 
(1954) and has been more recently considered 
by Blalock (1962), Pelz and Andrews (1964), 
and Campbell and Stanley (1963). It in- 
volves gathering similar data on the two 
variables that are to be tested for causality at 
two different points in time. In the case of 
the present problem, this would mean gather- 
ing identical expectancy-attitude data and 
identical job-performance data at two sepa- 


is measured before, after, and at 
time as A, it is possible to determine whether 
the hypothesis A B or B A is more 
tenable. Translating this to the present prob- 
lem it means that, if the expectancy attitudes 


ti tz 


Job Perfo 
Fic. 1. Cross-lagged correlational model. 
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cause the performance, the strongest relation- 
ship should be between the attitude measure 
at Time 1 (ti) and the performance measure 
at Time 2 (tz). 

Vroom (1966) has recently suggested a 
second method that can be used in order to 
determine the causal basis for the relationship 
between two variables. This method, like the 
cross-lagged approach, requires identical mea- 
surements at two points in time. It has the 
advantage of being strong where the cross- 
lagged approach is weak, that is, it can help 
rule out the possibility that a third variable 
is responsible for the relationship found. How- 
ever, unlike the cross-lagged approach it can- 
not tell whether A B or B A. All it can 
tell is something about whether a third vari- 
able has caused both. Thus these two ap- 
proaches are complementary in nature and 
should be used together, 

Dynamic correlations are computed by cor- 
relating the difference between A at ti and tz 
with the difference between B at ti and tə. 
Thus, in effect, changes in one variable are 
being correlated with changes in another 
variable. To produce a spurious dynamic cor- 
relation between two different scores AA and 
AB, a third variable must change in different 
amounts or directions in the members of the 
sample, and these changes must be highly cor- 
related with both AA and AB. Obviously it 
is extremely unlikely that any variable will 
meet these conditions and, because of this, 
most potential third causal variables can be 
ruled out. Thus when a significant dynamic 
correlation is found between two variables, 
one can put more faith in the fact that one 
caused the other than one can when a sig- 
nificant static correlation appears between two 
variables. This is particularly true in situa- 
tions where the researcher can determine that 
many other variables are not changing at all 
or are not changing in a way that would lead 
to their accounting for the significant dynamic 
correlations. 

There is one additional issue that must be 
considered when causal-correlational methods 
such as those described here are used to test 
for causality in situations where the causal 
variable is likely to be continuously changing 
over time. This issue has to do with how much 
time actually elapses between changes in one 
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variable and changes in the other. ‘This issue 
also serves to highlight a difference between 
the two methods. Dynamic correlations would 
seem to be most likely to show Positive results 
when the change in one variable leads almost 
immediately to a change in the other variable, 
that is, where the two variables can, for 
practical purposes, be thought of as moving 
almost together, although causali ty still exists, 
Since this method compares changes in A 
with changes in B between two points in time, 
the changes in A must be quickly reflected in 
B if the changes in the two from ti to te 
are going to be highly correlated. If a long 
time lag exists between changes in A and 
their effects on B then the changes in A be- 
tween ti and tz will not be reflected in the 
changes in B between t, and tə regardless of 
how far ti and ts are separated. They would 
be more likely to appear in the difference be- 
tween B at tz and tz. 

The cross-lagged approach would seem to 
be most likely to show positive results when 
there is some delay between changes in the 
first variable and changes in the second vari- 
able, and when the separation between data- 
collection points approximates the delay be- 
tween changes in one and changes in another. 
If the impact of the change is very rapid, 
then the cross-lagged approach will not show 
Positive results since it depends on Variable 
A measured at ti influencing Variable B mea- 
sured at te more strongly than it influences 
Variable B measured at tı. If the time lag 
for the effect of A on B is very short the 
effect probably will already have taken place 
at ti. And, thus, A will be as highly cor- 
related with B measured at tı as with B 
measured at ta. Considering the specific prob- 
lem in the present study, it would seem that 
there may be considerable delay between 
changes in expectancy attitudes and changes 
in performance. Although an individual might 
start working hard immediately after an at- 
titude change it undoubtedly would take a 
period of time for this actually to show up in 
the relatively crude performance measures 
that are presently available. On the other 
hand, if ti and ts are not separated by 4 
sufficient period for A at ti to affect B at tz 
the cross-lagged analysis will not show pos- 
tive results, In order to allow the cross-lagged 
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ATTITUDES AND Jon PERFORMANCE 


approach sufficient time to show positive re- 
sults, it was decided to separate the two data- 
collection sessions by 1 yr. 


METHOD 
Sample 


Data were collected from 55 managers in social 
service agencies. Included were the directors and all 
the managers of a YMCA, a visiting nurse service, a 
Girl Scout council, and a city department of 
health. Of the 55 managers studied, 41 could be con- 
sidered first-level supervisors while the remaining 
managers were at higher levels of supervision. A 
comparison between the demographic characteristics 
of the sample and the characteristics of other sam- 
ples of middle- and lower-level managers (eg, 
Porter, 1961, 1962) revealed good comparability with 
one exception. Thirty-one of the managers in the 
present sample were females while none of the pre- 
vious samples contained any female managers, Other 
studies (Lawler, 1967; Lawler & Porter, 1967a, 
1967b) have included data gathered from part of 
this sample. 


Attitude Measures 


The attitude data were gathered by means of a 
short questionnaire. One part of the questionnaire 
was designed to measure the managers’ reward-ex- 
pectancy attitudes. It listed six rewards each of 
which was assumed to be related to the satisfaction 
of one or more needs: pay, promotion, prestige, se- 
curity, autonomy, and opportunities to use skills 
and abilities, For each of these six rewards the 
Managers were asked to indicate on a 7-point scale 
how helpful six factors were for obtaining the 
rewards in their organizations (see Lawler & Porter, 
1967a, 1967b, for a description of this questionnaire). 
Two of the factors that were listed were particularly 
concerned with the job performance of the managers 
(working hard and quality of job performance). 
Previous studies (Lawler & Porter, 1967a) have 
shown that these two items tend to correlate highly 
for managers when the managers are asked to indi- 
cate how helpful the items are in obtaining im- 
Portant rewards. . 

A second part of the questionnaire contained a 
section that asked each manager to indicate how im- 
portant the six reward items were to him. For these 
ratings a 7-point scale (1 unimportant to 7 very 
important) was used. In order to develop ex- 
Pectancy-attitude measures for each manager, the 
data from the importance measures were combined 
with the data concerned with the likelihood that 
good performance and working hard would lead to 
the rewards. As suggested by the expectancy theories 
of motivation, this was done by multiplying the im- 
portance of the reward times the degree to which 
it was seen to be obtainable from both good per- 
formance and working hard. The results of these two 
multiplications were then summed for each of the six 
Tewards so that for each manager a composite ex- 
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pectancy-attitude measure was developed of the de- 
gree to which he felt receiving cach of the six 
rewards was based upon his job performance 


Job-Performance Measures 


Each manager was ranked relative to other man- 
agers holding similar jobs by his superior and by 
his peers. Managers were ranked on four factors, 
one of which, the quality of his job performance, is 
relevant for the present study. The rankings were 
converted to standard scores with a mean of SO and 
a standard deviation of 10. Where a manager was 
ranked by more than one peer, an average peers’ 
ranking was computed. In addition to being ranked 
by his superior and peers, each manager was asked 
to rate himself on a number of factors, one of which 
was the quality of his job performance, These rat- 
ings were made on a 7-point scale ranging from 1 
(low in comparison to similar others) to 7 (high in 
comparison to similar others). 

Analysis of the rankings showed fairly good agree- 
ment between the superior and peer rankings (eg, 
r= .56 between peer and superior rankings on job 
performance at tı and r= 4 for ts). It was en- 
couraging to note that superiors’ and peers’ rankings 
correlated most highly when they were ranking the 
managers on identical factors. The correlations be- 
tween superior and peer rankings on the same 
factor also generally tended to be higher than the 
relationships among the superiors’ and peers’ rank- 
ings of the same people on different factors. Thus, 
the superiors’ and the peers’ rankings substantially 
met the requirements of convergent and discriminant 
validity (Campbell & Fiske, 1959). The situation 
with respect to the self ratings was quite different, 
however. The self ratings of job performance showed 
only a low relationship to the peers’ rankings (r= 
.26 at ti) and to the superiors’ rankings (ry =.09 at 
ti). Thus, the self ratings did not show convergent 
and discriminant validity. 


Procedure 


Two meetings, 1 yr. apart, were held with all 
the participants in the study, At both meetings they 
were asked to fill out similar attitude questionnaires 
and ranking forms. It was carefully explained to all 
participants that they were participating in a univer- 
sity-sponsored research project and that all the data 
obtained from them would be held in the strictest 
confidence. The researcher remained in contact with 
the organizations during the year in which the 
study took place in order to be aware of any 
major changes in the organizations. None of the 
organizations, however, changed in size, purpose, 
structure, or in administrative procedures. There 
were some changes in personnel during the period. 
A total of five managers left the organizations. This, 
of course, meant some changes in who ranked certain 
people when the te data were collected. Some man- 
agers were ranked by different superiors and peers 


at tı and at te. 
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RESULTS 


In presenting the data collected, first the 
results concerned with the superior rankings 
will be considered, next those concerned with 
the peer rankings, and finally those concerned 
with the self ratings, Figure 2 therefore pre- 
sents the data for the cross-lagged correla- 
tional analysis for the relationships among 
the expectancy-attitude measures and the 
superiors’ rankings of job performance. Be- 
cause expectancy theory calls for a linear 
combination of each manager’s six reward- 
expectancy measures, in order to predict his 
job performance, multiple correlation coeffi- 
cients were used for Relationships 1, 2, 5; 
and 6. These four relationships represent all 
the instances where attitude measures were 
correlated with performance measures. Thus, 
all tests of causality were made by comparing 
the relative sizes of multiple correlation coeffi- 
cients. The correlation between the two per- 
formance measures is a Pearson product- 
moment while the correlation between the 
expectancy attitude measures is the average 
Pearson 7 for the six rewards considered. 

The correlations presented in Figure 2 be- 
tween the attitudes and the superiors’ per- 
formance measures bear the kind of relation- 
ship to each other that would be expected 
if the attitudes caused the performance. The 
lowest correlation is between the performance 
at ti and the attitudes at tə, while the highest 
correlation is between the attitudes at ti and 
the performance at to. In addition the correla- 
tions between the attitudes and performance 
data when the coefficients are computed from 
data collected at the same point in time fall 
between these two correlations. In order to 
compute the dynamic correlations between 
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Fic. 2. Cross-lagged correlational analysis for mea- 
sures of expectancy attitudes and superiors’ rankings 
of job performance (* p< 05; ** p< .01). 
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the changes in the attitude variables and the 
changes in the performance measure, a multi- 
ple correlation coefficient was used since 
changes in six attitude variables were being 
related to the changes in the performance 
measure. For the superiors’ rankings, this 
coefficient did show positive Beta weights for 
the predictors as would be expected if in- 
creases in the attitude measure were associated 
with increases in the performance measure 
but the coefficient did not reach statistical 
significance (R = 32). 

Figure 3 presents the cross-lagged correla- 
tional data for the analysis of the peers’ rank- 
ings of job performance. These data do not 
provide a close fit for the model that indicates 
the attitudes caused the performance. The 
relationship of Correlation 5 to Correlation 6 
Suggests that the attitudes caused perform- 
ance, but Correlations 1 and 2 do not fall in 
between these two as they would be expected 
to if this analysis were to provide strong 
support for the view that the attitudes caused 
the performance. The multiple correlation 
computed between the changes in the attitude 
measures and the changes in the performance 
measures was not significant although the Beta 
weights were positive (R = .41 Js 

Figure 4 presents the cross-lagged correla- 
tional data for the analysis of the self ratings 
of job performance. As was true for the peer 
ratings, this analysis is suggestive of the fact 
that the attitudes caused the performance but 
the fact that Correlations 1 and 2 do not fall 
between 5 and 6 prevents the data from per- 
fectly fitting the model of the attitudes caus- 
ing the performance, Nevertheless, the fact 
that the attitudes were better predictors of 
the performance in the future than the per- 
formance was of the attitudes in the future is 
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suggestive of the fact that the performance 
caused the attitudes. The dynamic correlation 
between the changes in the expectancy at- 
titudes and the changes in the performance 
measure was significant (R = .54, p < .01), 
and as was expected the Beta weights were 
positive. 


DISCUSSION 


The results of the study are generally sup- 
portive of the view that expectancy attitudes 
can best be thought of as causing performance. 
The cross-lagged evidence would seem to rule 
out the possibility that the frequently found 
relationship between expectancy attitudes and 
performance comes about because the perform- 
ance causes the expectancy attitudes. The fact 
that in all three cross-lagged analyses the 
attitudes at tı predicted the performance at 
te better than the performance at ti predicted 
the attitudes at tə would seem to provide 
good support for this point. Also supportive 
of this view is the fact that the attitude to 
performance correlations were generally higher 
for the predictions of the attitudes into the 
future than they were for the static correla- 
tions between the attitudes and the perform- 
ance at the same point in time. These latter 
results also suggest that one way previous 
studies might have been able to obtain a 
stronger expectancy attitude to performance 
relationship would have been to delay the 
collection of the performance data until some 
time after the attitude data had been collected. 

The data from the dynamic correlations is 
only suggestive of the fact that a third vari- 
able is not responsible for the relationship be- 
tween the attitudes and the performance. Only 
one of the three dynamic correlations was 
significant, yet all three generally took the 
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form that would be expected if changes in the 
attitudes led to changes in the performance 
without a third variable being involved. The 
data from the cross-lagged analysis suggest 
that a third variable does not simultaneously 
cause both the attitudes and the performance, 
but it does not rule out the possibility that a 
third variable first causes the attitudes and 
then the performance, or that the attitudes 
cause a third variable which in turn causes 
the performance. Overall, therefore, the data 
would seem clearly to rule out the possibility 
that the positive relationship found between 
performance and expectancy attitudes comes 
about because the performance causes the 
attitudes and is suggestive of the fact that it 
comes about because the attitudes caused the 
performance, 

Part of the reason that the dynamic cor- 
relations were not higher may well be found 
in the time relationships that may occur when 
expectancy attitudes cause performance. It is 
reasonable to expect that there might be a 
fairly long time lag between changes in ex- 
pectancy attitudes and changes in perform- 
ance measures. This would seem to be par- 
ticularly true where superior and peer rank- 
ings of performance are used. Even where at- 
titude changes almost directly lead to perform- 
ance changes, it might be some time before 
the performance changes could actually be 
expected to show up in the rankings of the 
performance. This would be expected because 
of the time needed to convince the raters 
that a real change has actually taken place 
and because of the time needed for the raters 
actually to become aware of the changes 
Some support for this thinking is provided 
by the fact that the dynamic correlation was 
significant for the self ratings but not for the 
peer and superior rankings, since presumably 
the self ratings would change more quickly. 
If a significant amount of time does in fact 
actually elapse between changes in expectancy 
attitudes and consequent changes in per- 
formance, then dynamic correlations may 
never show strong results where these changes 
are considered. In summary, it may be that 
the failure to find high dynamic correlations 
is not so much due to the fact that a third 
causal variable is involved, but the fact that 
there is a significant time lag built into the 
causal relationship. 
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It is impossible at this time to say exactly 
what the time sequence actually should be 
between the two measurements of the at- 
titudes and performance variables if the causal 
relationship is to be tested most effectively. 
The I-yr. separation chosen in the present 
study was an arbitrary time, hence future re- 
search should consider collecting data with 
differing intervals between the two collec- 
tion points. A logical next step would appear 
to be a study where some data are collected 
with a 6-mo. separation, some are collected 
with a 1-yr. separation, and some are collected 
with a 2-yr. separation. By comparing the 
results obtained for the varying periods of 
separation it should be possible better to 
understand the time sequence that is involved 
in any causal relationship between expectancy 
attitudes and performance and to test for the 
causal relationship better. The correlations 
should be highest at the point in time which 
most closely approximates the actual time 
it takes for A to affect B. 

Overall, the results obtained from the two 
correlational-causal approaches that were used 
are sufficiently interesting to suggest that 
this approach can be useful in the study of 
many other topics where causal priorities are 
in dispute. For example, the relationship be- 
tween satisfaction and performance could be 
studied with this approach as could the rela- 
tionship between participant leadership styles 
and employee involvement. This, of course, 
does not mean that this type of analysis can 
ever replace traditional experiments as the 
best way to establish causalities since experi- 
ments do allow causal questions to be an- 
swered with a higher degree of confidence. 
However, the causal-correlational approaches 
do have some advantages that argue they 
deserve more widespread use than they have 
received. They have some of the advantages 
of the field experiment without many of the 
disadvantages. That is, data from which causal 
inferences can be made are obtained for real 
situations, as in the field experiment, but 
many of the ethical, cost, and feasibility 
problems that go with field experiments are 
not likely to occur. 
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This study investigated the relationship between motivator and hygiene satis- 
faction variables to job involvement. Ss were 96 civil service supervisors who 
completed the Wernimont job-satisfaction scale and a job-involvement measure 
developed by Lodahl and Keiner. The results of the study were that motivator, 
but not hygiene, satisfaction variables correlated with job involvement, In 
addition, total motivator satisfaction scores accounted for considerably more 
variance in overall job satisfaction than did hygiene variables. 


Although job involvement has been related 
to turnover (Bass, 1965), the determinants of 
job involvement have not been investigated. 
It is hypothesized that job satisfaction is one 
such determinant. The emphasis placed on 
job-satisfaction variables has resulted in a 
relative neglect of the job- involvement vari- 
able. This is perhaps due to a lack of con- 
ceptual differentiation between job satisfac- 
tion and job involvement, and to an apparent 
failure to realize that it is possible for some 
persons to be highly satisfied, but not in- 
volved, and for others to be highly involved, 
but not satisfied. Moreover, some sources of 
job satisfaction are probably more likely to 
be related to job involvement than are others. 
It was the purpose of this study to investigate 
the differential relationships between various 
sources of job satisfaction and job involve- 
ment. 


Job Involvement 


The definition of job involvement is com- 
plex. Lodahl and Kejner (1965) consider it 
to be indicative of an individual’s work com- 
mitment. Bass (1965) views it as representa- 
tive of the employee’s ego involvement in his 
job and thus relates it to performance. Bass 
(1965) finds that the following conditions 
lead to the strengthening of job involvement: 
opportunity to make more of the job de- 
cisions, the feeling that one is making an 
important contribution to company success, 

1 This research was supported in part by a United 
States Public Health Service Fellowship No. 1-F1- 
MH-33, 376-01 (BEH-A) from the National Insti- 
tute of Mental Health awarded to the senior author. 


recognition, achievement, self-determination, 
and freedom to set one’s own work pace. 
March and Simon (1958) suggest that job in- 
volvement may also be related to the decision 
to participate and the decision to produce. 
Among other things, both of these decisions 
are affected by the individual’s perception of 
a variety of alternatives. The person who is 
more job involved may perceive fewer avail- 
able alternatives; he may, therefore, be in- 
clined to participate more actively in the re- 
quired activities of the organization. Katz 
and Kahn (1966) state: First, that job in- 
volvement is a necessary condition if the in- 
dividual is to accept fully the organizational 
demands placed upon him by his member- 
ship in an organization; second, that the de- 
gree of job involvement is related to level of 
aspiration and to the degree of internalization 
of organizational goals; and third, that job in- 
volvement is a moderator variable in the rela- 
tionship between satisfaction and performance, 
for only among involved employees does a 
positive relationship between satisfaction and 
performance become evident. Thus job in- 
volvement can be considered as an important 
measure of organizational effectiveness that 
may be, at least in part, influenced by job 
satisfaction. 


Job Satisfaction and the Two-Factor Theory 


Herzberg, Mausner, and Snyderman (1959) 
formulated the “two-factor theory of job 
satisfaction.” Their research concluded that 
there is one set of conditions the presence 
of which induces increased job satisfaction, 
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but the absence of which does not induce job 
dissatisfaction; and that there is another set 
of conditions, the absence of which leads to 
job dissatisfaction but the presence of which 
does not lead to increased job satisfaction. 
They termed the first set of conditions “mo- 
tivators” since satisfaction with these con- 
ditions was “conceptualized as actuating ap- 
proach rather than avoidance behavior [Herz- 
berg et al., 1959, p. 114, italics added].” The 
second set were termed “hygienes” since these 
conditions appeared to contribute only to the 
prevention of an “unhealthy psychological 
work environment [Herzberg et al., 1959, 
p. 113].” The motivators are: recognition, 
achievement, advancement, responsibility, and 
the work itself. These variables have also 
been termed intrinsic or work- content vari- 
ables, because they are all presumably derived 
from performance of the job itself. The hy- 
giene conditions are: interpersonal relations 
with peers and with superiors, company policy 
and administration, superior’s technical com- 
petence, working conditions, and job security, 
Hygiene conditions have also been referred to 
as extrinsic or work-context conditions be- 
cause they are all derived from the environ- 
ment surrounding the job. Salary, depending 
upon whether or not it is contingent upon 
performance, may be either a motivator or a 
hygiene factor. 


Hypothesis 


In this study job involvement was con- 
sidered to be a quasi-indicator of motivation, 
Therefore, it appeared reasonable to test the 
Herzberg et al. (1959) generalization using 
job involvement as the dependent variable. 
The specific hypothesis which was formulated 
for this research is as follows: Satisfaction 
with motivator sources will be related to in- 
creased job involvement, but satisfaction with 
hygiene sources will be unrelated to job in- 
volvement. 


METHOD 
Organization and Subjects 


The Ss were supervisors of a state (civil service) 
department normally employing about 5,000 persons, 
They ranged in rank from first-line supervisors 
(G 17) to the highest nonappointive position in the 
department, the executive director (G 38). 

One hundred seven persons initially took part in 
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the study. However, all Persons did not complete 
every one of the instruments, and some, although 
completing each instrument, left some items blank 
Female employees were removed from the sample 
The final sample consisted of 96 male supervisors 
who were between 40 and 62 yr. of age. Their 
salaries ranged from $7,700 to $17,300. They super- 
vised from 0 to 18 persons (some were supervisors 
in name only). The Ss had a range of 8-35 yr. of 
service. The educational level of the supervisors was 
fairly high, and ranged from 12 to 18 yr. of formal 
education. Most of the supervisors had a background 
in accounting or finance. 


Administration Procedures and I nstruments 


A questionnaire was distributed and each partici- 
pant was allowed about 1 wk. to complete it. The 
questionnaires were collected at group sessions dur- 
ing which research instruments for another study 
were being administered. All precautions to preserve 
anonymity were carefully observed. 

Wernimont Job Satisfaction Scale? Wernimont's 
(1966) scale, which was devised to measure present 
satisfaction according to a model constructed by 
Herzberg et al. (1959), was used in this study. For 
example, satisfaction with security is measured by 
the following set of statements: 


— I feel happy with the amount of security I 
have on my job. (5) 

— I feel somewhat happy with the amount of 
security I have on my job. (4) 1 

—— I feel that the amount of security on my job 
is okay. (3) 

am not too happy with the amount of 
security I have on my job. (2) 9 

—— Iam not happy with the amount of security 
T have on my job. (1) 


(The numbers in the parentheses following each 
item indicate the scale value of that item.) 

The respondent is instructed to check two state- 
ments from each variable set which best describe his 
Present satisfaction. The satisfaction score for each 
source is the sum of the two items selected. The 
higher the Score, the greater the degree of satis- 


faction. 
Similar sets of statements are used to measure 
motivator satisfaction (recognition, achievement, 


work itself, advancement, and responsibility) and 
hygiene satisfaction (department policy, technical 
competence-superior, interpersonal relations-superior: 
working conditions, interpersonal relations-peers, 
and, here, satisfaction with pay). Soa 

One additional item asks the respondent to indi- 
cate his present overall job satisfaction on a 9-point 
scale (9 being most satisfied). 

Job Involvement (JI) Scale. The scale used to 
Measure job involvement was initially developed by 
Lodahl and Kejner (1965). Originally, a 40-item 
scale was developed; this was then reduced to 20 
„ 


2 This scale is copyrighted by P. F. Wernimont 
and was used with his kind permission. 
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items “by considering the item-total correlations, 
the communality of an item, and the factorial clarity 
of the item [Lodahl & Kejner, 1965, p. 28].” Lodahl 
and Kejner established the split-half reliability and 
the concurrent validity of this 20-item scale by ad- 
ministering it to three occupational groups: nurses, 
engineers, and students. Construct validity was estab- 
lished by relating job-involyement score to Ghiselli 
Self-Description Index, Smith’s Job Description In- 
dex, and the Ohio State Leader Behavior Description 
Questionnaire scores within the various groups of 
Ss employed (Lodahl & Kejner, 1965, p. 31). 

In order to develop a shorter JI scale for use when 
time and space are limited, Lodahl and Kejner se- 
lected six items from the 20-item scale on the basis 
of a principal components analysis of the data from 
the engineering and nursing samples. The split-half 
correlation of this scale was reported to be .57 and 
its reliability was estimated to be .73. The correla- 
tion between this scale and the 20-item scale was re- 
ported to be .87. The shorter version of the JI 
scale was used in this study. 

Of the six items in the short version of the JI, the 
first five are scored in a positive direction, from 1 
(strongly agree) to 4 (strongly disagree). The last 
item is scored in reverse. A higher total score indi- 
cates lower involvement. Although this scoring ap- 
pears to indicate job alienation rather than job in- 
volvement, it resulted from the method used in the 
construction of the original JI scale (Lodahl & 
Kejner, 1965, p. 27). At that time, items were put 
into a Likert-type format and assigned scores of 1 
to indicate “strong agreement” and 4 to indicate 
“strong disagreement” with scores of 2 and 3 indi- 
tating intermediate positions. As the item selection 
process which led to the construction of the shorter 
6-item scale was continued, the original scoring direc- 
tion was maintained. The items comprising the JI 
scale are listed below: 


1, The major satisfactions in my life come from 
my work. 

2, The most important things that happen to me 
involve my work. 

3. Tm really a perfectionist about my work. 

4. I live, eat, and breathe my job. 

5. J am very much involved personally in my 


work, 
6. Most things in life are more important than 
work, 
Method of Analysis 


Responses to the items on the two instruments 
Were scored, and correlations were computed between 
Scores for each of the sources of job satisfaction and 
job involvement, Totals were also computed for the 
motivator and hygiene sources, respectively, by add- 
ing scores for the sources in each category. Correla- 
tions between these two totals and job-involvement 
Score were also calculated. Finally, the correlations 
between the satisfaction totals and overall satisfac- 
tion, and between overall satisfaction and job in- 
Volvement were also computed. 
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RESULTS 


Table 1 shows the correlations of the five 
motivator variables and the total motivator 
score with the JI scale score. Table 2 shows 
the correlations of the seven hygiene variables 
and the hygiene totals with the JI scale score. 

Table 1 shows that job involvement is re- 
lated to satisfaction with recognition, achieve- 
ment, and responsibility, although in each 
instance the percentage of total variance in 
job involvement accounted for is relatively 
small. The relationships between job in- 
volvement and satisfaction with work itself, 
and between job involvement and advance- 
ment, are not quite significant at the .05 level. 
These results confirm the hypothesis: Job in- 
volvement increases with increasing levels of 
satisfaction with motivator variables. 

Table 2 shows that, as predicted, job in- 
volvement is unrelated to all but one of the 
hygiene variables: satisfaction with inter- 
personal relations with the superior. This 
finding lends further support to our hy- 
pothesis since it indicates that, in general, 
satisfaction with hygiene variables seems to 
be unrelated to job involvement. 

Tables 1 and 2 also show that, as predicted, 
the motivator total satisfaction score cor- 
relates significantly (—.30) with job involve- 
ment. However, as expected, the correlation 
between the hygiene total satisfaction score 
and job involvement is not significant. Al- 
though the difference between these two cor- 
relations, and between the percentage of vari- 
ance explained in each case, is not great it 


TABLE 1 


CORRELATIONS OF MOTIVATOR VARIABLES AND 
TOTAL WITH JOB INVOLVEMENT 


A Correlation with job 
Satisfaction variable l 


involvement 

Recognition (a) —.25* 
Achievement (b) —.21* 
Work Itself (c) —.19 
Advancement (d) —.19 
Responsibility (e) —.27** 

Motivator total 

(scores abg... +e) — .30** 


Note.—N = 92; 4 Ss did not respond to the JI scale. 
* p <.05, df = 90. 
** p S. 01, df = 90. 


TABLE 2 


CORRELATIONS OF HYGIENE VARIABLES AND 
TOTAL WITH JOB INVOLVEMENT 


Satisfaction variable emir tier job 
Security (f) 03 
Salary (g) —.13 
Policy (h) —.10 
Technical Competence- 

Superior (i) —.15 
Interpersonal Relations- 

Superior (j) —.22* 
Working Conditions (k) —.16 
Interpersonal Relations- 

Peers (ö) 04 

Hygiene total 
= scores f-+g+. . . +1) —.18 


Note. — v = 92; 4 Ss did not respond to the JI scale. 
*p <.05, df = 90. 


does lend further support to the hypothesis 
of this study, 

Table 3 presents the correlation of the 
motivator and hygiene totals with overall 
satisfaction and also indicates the percentage 
of total satisfaction variance accounted for. 

Table 3 shows that the total motivator 
satisfaction and the total hygiene satisfac- 
tion scores correlate significantly with overall 
satisfaction scores, There is a relatively large 
difference in the Percentage of total variance 
in satisfaction accounted for by each of these 
variables. 


DISCUSSION AND CONCLUSION 
Although the magnitude of the relationships 
was not large, increased job involvement did 


TABLE 3 


CORRELATIONS OF MOTIVATOR AND HYGIENE 
SATISFACTION TOTALS WITH OVERALL 


SATISFACTION 
Correlation with 
Total overall % of variance 
satisfaction 

Motivator 
(2 scores on motiva- 

tors) 1701 49 
Hygiene 
(2 scores on 

hygienes) .60* 36 

Note.— V = 96, 


*p <.01, df = 90, 
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appear to be related to satisfaction with 
motivator variables. It therefore appears that 
the distinction between motivator and hy. 
giene variables introduced by Herzberg et al, 
(1959) can be useful in predicting the job 
involvement of civil service supervisors, At 
the same time, two motivator variables, satis. 
faction with work itself and satisfaction with 
advancement, were not significantly related to 
job involvement. Advancement in the civil 
service, however, is based largely upon sen- 
iority and performance on competitive ex- 
aminations. It therefore seems unlikely that 
this variable would necessarily function as a 
motivator. 

Tn addition to the findings which supported 
the initial hypothesis, the results (see Table 
3) concerning the relationship between overall 
job satisfaction and the total motivator and 
the total hygiene satisfactions seem to shed 
further light on the controversy surrounding 
the Herzberg et al. (1959) model of job satis- 
faction. The extent of this controversy is il- 
lustrated by two recent reviews of research 
relevant to the two-factor theory. In one such 
review, House and Wigdor (1967) reached 
the conclusion that the two-factor theory 
basically has no merit. In another review, 
Whitsett and Winslow (1967) came to the 
opposite conclusion and indicated that this 
approach to understanding job satisfaction 
has a good deal of merit. The majority of 
studies critical of the two-factor theory 
reached the consensus that both hygiene and 
motivator variables contribute linearly to 
overall job satisfaction. In the present study, 
this conclusion was also borne out, since total 
hygiene satisfaction and total motivator satis- 
faction both correlated significantly with over- 
all satisfaction. However, total motivator 
satisfaction accounted for 49% of the variance 
in overall satisfaction, while total hygiene 
satisfaction accounted for only 36% of this 
variance, Although this study was not de- 
signed to test that hypothesis fully, the 
present findings seem to lend some support to 
the two-factor model of Herzberg et 
(1959). The major focus of this study sis 
on the consequences of motivator and hygiene 
satisfactions for job involvement, and here, 
also, the results were supportive of the mo- 
tivator-hygiene model. 
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TRAINING FOR VIGILANCE: 
COMBINED CUEING AND KNOWLEDGE OF RESULTS? 
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To test transfer of training in a visual monitoring task, 44 Ss were given a 
48-min. training session in 1 of 4 conditions forming a 2 X 2 factorial design. The 
4 training conditions were knowledge of results (KR), cueing (also known as 
prompting), KR and cueing, and a control group receiving neither training aid. 
7 days later all Ss performed the same task with no training aids. Results 
showed that the KR-trained groups detected significantly more signals during 
transfer, but the cueing groups showed no significant differences. The group 
trained with the combination of KR and cueing did no better than the KR-only 


group. With respect to commissive errors (false alarms), there was no sig- 
nificant difference due to KR treatments, but the groups receiving cueing during 


training made significantly fewer during transfer. 


All 4 groups showed a 


decline in detection performance over time periods in the transfer session. 


In an effort to elevate detection perform- 
ance in vigilance tasks, several experimenters 
have turned to transfer-of-training techniques 
employing immediate feedback, or knowledge 
of results (KR), as a training aid. For a 
variety of monitoring tasks, it has been shown 
that operators trained with KR continue to 
outperform those trained without it (NKR) 
in later sessions when the KR has been re- 
moved (Adams & Humes, 1963; Hardesty, 
Trumbo, & Bevan, 1963; Mackworth, 1964; 
Wiener, 1963, 1967). Transfer of training in 
monitoring has been reviewed by Wiener 
(1967). 

Recently another training aid has made a 
bid for attention: This method is known 
variously as guidance, prompting, or cueing. 
While KR is response contingent and occurs 
after a signal, missed signal, or false alarm, 
cueing takes place before the onset of the 
signal, alerting the operator that it is about 
to appear on his display. The leading pro- 
ponent of cueing in vigilance tasks is Annett, 
who states that cueing is superior to KR as 
a training device. Annett and Clarkson 
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and Industrial Health. The authors gratefully 
acknowledge the assistance of Mary Dutcher and the 
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Engineering School, University of Miami. 
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(1964) found superior transfer in the cueing 
group compared to the KR group, but Annett 
and Paterson (1966) found no differences. 
Annett (1966, Experiment III) also reports 
cueing superior to KR in the transfer session; 
however, in that study the KR was not im- 
mediate feedback, but summary information 
provided every 5 min. In a recent review of 
five of their experiments, Annett and Pater- 
son (1967) conclude that both KR and cue- 
ing increase detections, but that KR also 
tends to increase false reports. Finally, Col- 
quhoun (1966) attempted to compare the 
transfer effects of various cueing and KR 
training schemes, but failed to find any differ- 
ences between the experimental and control 
groups. Training with cueing and KR has 
been reviewed by Aiken and Lau (1967) and 
Lau (1966), who conclude that the benefits of 
cueing as a training device may have been un- 
derestimated, relative to KR, due to the 
prominence of the law of effect in learning 
theory. 

This experiment sought to investigate the 
relative merits of cueing and KR in achieving 
Successful transfer of training in visual moni- 
toring, and particularly to examine whether 
any advantage could be obtained by combin- 
ing the two. The experiment is essentially à 
2 x 2 factorial design, consisting of the four: 
fold combinations of KR and NKR, cueing 
and no cueing. 


TRAINING FOR VIGILANCE 


METHOD 
Subjects 


The Ss were 44 undergraduate students in ele- 
mentary psychology courses, 28 male and 16 female 
They were recruited for the experiment with the 
offer of an hour's credit toward their experimental 
requirement or $1.50 for each of the two sessions. 
The Ss were randomly assigned to groups with the 
exception that the male: female ratio was kept 
balanced. The groups are designated by their training 
conditions: KR (KR, no cueing), Q (cueing, no 
KR), KR +Q (both training aids), and N (neither 
training aid). 


Apparatus 


The task was an adaptation of one first used by 
Jenkins (1958), in which a voltmeter needle made 50 
rightward deflections of 20°/min, the needle deflect- 
ing rapidly and returning to its resting position 
before the next deflection. The signal was a larger 
(30°) deflection. The meter faceplate was painted 
flat black and the needle white, and Ss viewed the 
meter on small closed-circuit television monitors in a 
toom separated from the apparatus. They responded 
to signals by pressing a hand-held cord switch. The 
deflections, both background and signal, and cueing 
were scheduled on punched paper tape, stepped 
through a reader at 50 characters per minute. The 
signal schedule consisted of 32 signals occurring ran- 
domly during a 48-min. vigil, with the restriction 
that eight signals occurred during each 12-min. 
period, and the minimum intersignal interval was .3 
min, KR was provided automatically by a logic 
circuit which determined whether any response was 
à detection during the allowable time (2.5 sec.) or a 
commissive error. This information was displayed on 
a small lightbox located over the television sets. De- 
tections resulted in a green light, commissive errors 
in red, and nondetections were indicated by an 
amber light which remained on for 5 sec. after the 
allowed response period. The box is similar to that 
shown in a previous study (Wiener, 1963, Figure 1). 
The KR box was removed from the booth during 
training for Group Q and Group N and during all 
transfer sessions. All Ss wore earphones which played 
White noise from a Grason-Stadler Model 455C gen- 
trator, to achieve auditory isolation. 

Cueing was provided in the following manner: 3.6 
‘ec. prior to a signal, the white noise was interrupted 
and a loud 1000-cps tone was presented in its place 
for 2.7 sec. The tone ended just prior to the signal 
deflection. The group receiving both cueing and KR 
(KR +Q) was simply provided with both the audi- 
‘ory cueing and the visual KR, and since they 
were separated by the signal and a 2.5-sec. response 
Petiod, there was obviously no conflict or confusion 
etween the two. 


Procedure 


The Ss received initial instruction on the task 
and special instruction on KR and cueing, as ap- 
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propriate to their group. The Ss who did not receive 
cueing, KR, or both were not aware of its existence 
in other groups. Following the instruction and dem- 
onstration period, a standard 48-min, training session 
was run. The Ss were reminded to return at the 
same time a week later, and on their return all Ss 
performed another vigil with neither cucing nor 
KR. The Ss in the groups that had received one (or 
both) of the training aids were simply told that it 
would not be present for this session. The signal 
schedule for the transfer session was the same as 
the previous session, but run in reverse, provid- 
ing the same intersignal intervals with a different 
order. The data collected were detections and com- 
missive errors (false alarms) by 12-min. periods. 


RESULTS 


The data collected consisted of the number 
of signals detected by each S during each of 
the four 12-min. periods, and commissive 
errors by periods. A commissive error was 
any response not preceded (within 2.5 sec.) 
by the presentation of a signal. 


Detections 


The detection data by groups and periods 
for the two sessions are shown in Figure 1 in 
terms of percentage of signals detected. In 
the transfer session all Ss were run under con- 
ditions of no feedback and no cueing (identi- 
cal to Group N in the training sessions), but 
are labeled according to the conditions under 
which they were trained a week earlier. The 
detection data for the two sessions were 
analyzed separately as two partially hier- 
archal analyses of variance, with Ss nested 
in one of the four KR-Cueing treatments, but 
common to the four time periods during a 
session (Winer, 1962). The data were first 
subjected to Cochran’s test for homogeneity 
of variance, and in neither analysis was the 
null hypothesis of equal variances rejected. 
The analysis of variance for the training ses- 
sion is shown in Table 1, and for the trans- 
fer session in Table 2. 

Table 1 indicates that both training vari- 
ables affected detection rate. Of course, one 
should place little importance on the signifi- 
cance of the cueing variable during the train- 
ing session, as there is no reason why the cue- 
ing groups should have anything less than 
nearly perfect performance. The difference 
between Group Q and Group KR + Q was not 
statistically significant, according to a Duncan 
multiple-range test applied to the pair-wise 
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12 MINUTE BLOCKS 


Fic, 1, Percentage of signals detected during training and transfer sessions as a function 
of training conditions and time periods. 


differences between the four training condi- 
tions, but all other pair-wise differences were 
statistically significant. The fact that the KR 
main effect was Statistically significant, as 
was the pair-wise difference between Group 
KR and Group N (p< 05), substantiates 
the familiar finding that KR produces su- 
perior monitoring performance. The signifi- 
cance of the KR X Periods interaction fur- 
ther supports previous contentions (Wiener, 


TABLE 1 


ANALYsIs OF VARIANCE OF SIGNALS DETECTED 
DURING TRAINING SESSION 


Source 


Between Ss 
KR. (A) 
Cueing (B) 
AXB 
Error, 

Within Ss 
Periods (C) 
AXC 


1963) that KR groups often show no decline, 
and often even an increment in performance 
at the same time that the control group shows 
the familiar vigilance decrement. 

The analysis of variance of the transfer se 
sion clearly indicates the transferability of 
differences produced by training with knowl 
edge of results, while the differences produced 
by providing cueing in the training session, 
extreme as they were, failed to transfer after 


TABLE 2 


ANALYsIs OF VARIANCE OF SIGNALS DETECTED 
DURING TRANSFER SESSION 


Source MS F 
— 

Between Ss 43 F 
KR. (A) 1 72.55 6.71 
Cueing (B) 1 0.14 SN 
AXB 1 4.78 oF 
Error, 40 10.82 

Within Ss 132 
Periods (C) 3 21.52 9.70" 
AXC 3 5.61 2.52 
BNC 3 0.32 = 
AXBXC 3 26 a 
Error 120 2.22 

Total 175 


â Knowledge of results. 
bd 02. 
* <.001. 
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a 7-day intervening period. Figure 1 indicates 
that cueing did not affect the performance of 
the group during the transfer session, and 
pair-wise range tests show no difference be- 
tween Groups KR and KR +Q, or Groups 
Qand N. The significance of the period’s main 
elect, coupled with the nonsignificance of 
interactions of the training variables with 
periods, points out that training techniques 
affect the overall performance level of the 
groups after transfer, but fail to prevent vigi- 
lance decrement. 


Commissive Errors 


Since commissive errors characteristically 
show an extreme positive skewness, the au- 
thors have adopted the practice of analyzing 
between-group differences of the number of 
$s above or below the overall median number 
of commissive errors (see Wiener, 1967). 
Thus, the total number of commissive errors 
for each S was computed, and the median of 
these for the four groups combined was found 
to be 0.0 for the training session and 1.0 for 
the transfer session. The between-groups con- 
trasts were then analyzed by forming a con- 
tingency table with groups as one dimension 
and the frequency of Ss above and below the 
median as the other dimension. The Ss mak- 
| ing exactly the median number of commissive 

errors were evenly divided into the frequency 
categories. The data were then analyzed us- 
ing the chi-square test for rom and column 
independence. 

Since commissive error rates are meaning- 
less while cueing is being provided to Ss, 


TABLE 3 
Tota Cosnmsstve Errors IN TRANSFER SESSION 


Group* 
Period Total 
KR N 
£0 KR | O 
— aA — 
1 4 6 1 10 2¹ 
2 3 10 7 7 27 
3 2 15 4 6 27 
4 1 10 6 il 28 
Total 10 41 18 34 103 


* Abbreviations: = 1 results, no cueing; 
Da cueing, no KR owes oth training aids; N = 
ther training aid, 


m 


TABLE 4 


Nowsze or Sceyrers Marmo Less max, Equat to, 
Axo Geeatee max rue Ovezate Ment 
Cu Esxzoss 


Grou 
TAAT E ů— ia 
KR+Q| KR Q | N 
0 7 2 5 | 2 
1 2 4 r 
>I 2 5 3 8 
Total i 11 1 | il 


in the training session only the KR and N 
groups were tested. Of the 22 Ss receiving 
cueing, 19 made no commissive errors, and the 
remaining 3 made a total of 10. Con- 
sidering only the KR and N groups, the 
overall median was three commissive errors, 
and the analysis of the contingency table re- 
vealed a nonsignificant difference (x? = 2.2, 
df = 1), indicating that KR did not affect 
the false-alarm rate. 

The raw commissive error data for the 
transfer session are shown in terms of total 
commissive errors by groups and time periods 
in Table 3, and by numbers of Ss making 
less than, equal to, or greater than the median 
in Table 4. Median tests revealed that over- 
all the four groups were not significantly dif- 
ferent (x? = 4.25, dj = 3), nor was the con- 
trast of the two KR groups versus the two 
non-KR groups (x7=0.21, df=1). The 
analysis of the two cueing versus the two non- 
cueing groups indicated a significant differ- 
ence (x? = 3.84, dj = 1), with the combined 
cueing groups containing more Ss below the 
median number of false alarms. 


DISCUSSION 


The results of this study are further evi- 
dence that the most effective way to train a 
monitor for a monotonous watchkeeping job 
is to train him with immediate knowledge of 
results, and lend no support to the conten- 
tions of Annett and his collaborators that 
cueing, or prompting, is more effective. In 
fact, for this task, the authors were unable 
to find that cueing was effective at all, and 
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combining cueing with KR offered no im- 
provement over the straight KR training 
method. To be sure, there are great differences 
between the auditory discrimination tasks 
used by Annett and his collaborators and the 
simple visual task employed here, and on a 
purely speculative basis the authors would 
agree with Lau (1966) that cueing may be- 
come more effective as the perceptual com- 
plexity of the task increases. In a very sim- 
ple one-dimensional monitoring task, there is 
so little to be learned about the nature of the 
signal itself that the benefits of cueing, per- 
haps what Annett and Paterson (1967) view 
as providing the S with “an image or template 
against which further instances can be as- 
sessed,” are minimized. They also regard KR 
not as a reinforcer, but as an inefficient form 
of retroactive cueing. 

However, for a simple and monotonous 
monitoring task, the authors prefer to inter- 
pret the benefits due to KR in traditional law 
of effect terms, stressing more the motiva- 
tional than the information component of 
KR. In very simple terms, S learns in the 
training session to maintain a higher level of 
alertness due to the reinforcing properties of 
KR, and he carries this tendency to the 
transfer session. The $ may indeed obtain 
some retroactive cueing from the KR and 
learn something about the nature of the 
signal, but this should be viewed as a 
bonus dividend of KR and not its primary 
payoff. There is a strong possibility that S 
actually learns little or nothing as long as 
cueing is being provided, either because he 
may become cue dependent” and pay little 
attention to the character of the signal itself, 
or because he may actually regard cueing as 
annoying. Some Ss in this study volunteered 
unfavorable comments about cueing, describ- 
ing it as irritating and “taking the challenge 
out” of the training session. 

In summary, with the exception of the 
finding that the cueing groups produce sig- 
nificantly fewer commissive errors than the 
noncueing groups (as also reported by Annett 
& Paterson, 1967), the results of the present 
study, and of Colquhoun (1966), offer no sup- 
port for cueing as a training aid in watch- 
keeping. Annett’s strongest case is based not 
around detections, but the ability of cueing 
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to reduce commissive errors (which the 
authors confirm), and the tendency of KR U 
encourage commissive errors (which the ap. 
thors do not confirm, in this or in a number 
of other experiments of this type). One should 
recognize that the method in which Ss were 
prompted in this experiment was quite arbi- 
trary, and an entire array of questions on 
the most effective way of presenting the 
prompt remains to be answered. These ques- 
tions involve such things as the length of the 
prompt, the sense modality, the duration be- 
tween the end of the prompt and the onset of 
the signal, and a host of other hardware and 
software considerations. But based on the 
evidence to date, the authors are far more 
impressed by the potential of KR than any 
other training aids, as a practical solution to 
the many problems of man at watch. The 
authors will be eager to see if Aiken and Laus 
prophecy, that in the future there will be as 


much research on cueing as there has already) 


been on KR, will come to pass. 

Finally, Swets and his collaborators (Swets, 
Harris, McElroy, & Rudloe, 1964; Swets, 
Millman, Fletcher, & Green, 1962) have ex- 
amined the efficacy of computer-aided instruc- 
tion in learning auditory identification, and 
Weisz and McElroy (1964) have found si- 
perior performance in visual identifications in 
a group combining cueing and KR. Noted 
recently is Mackie and Christensen’s ( 1961) 
lament over the lack of practical application 
(to training) of the findings of experimental 
psychologists, and their call for “learning 
engineers” to tackle the problem. It appears 
that the monitoring task is ripe for the imple- 
mentation of new technologies, and that the 
tools, ranging from hardware such as com- 
puter terminals and teaching machines, to 
software such as cueing, KR, signal schedules, 
and various tangible rewards, are readily at 
hand. 
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Measures of the traits of intelligence, supervisory ability, and self-assurance, 
together with measures of the desires for job security, high financial reward, 
self-actualization, and power were obtained on 271 managers. In order to ascer- 
tain whether the motivational factors operated as moderators, on each of the 
motivational factors the managers were divided into those for whom the moti- 
vation was strong and those for whom it was weak, and separately for each 
group the relationship between scores on the 3 trait measures and job success 
was determined. It was found that in 14 of the instances the relationships 
between the trait measures and managerial success were significantly moderated 
by the measures of motivation thus indicating that there are interactive effects 
between traits and motivational factors in the determination of the success of 


managers, 


Measures of various sorts of traits, both 


traits of ability and traits of personality, hay? 


been found to be related to the degree of suc- 
cess attained in management positions. Vari- 
ous sorts of motivational factors, too, have 
been shown to play a part in determining 
managers’ performance. While it seems an ob- 
vious point, the fact of interaction between 
traits on the one hand and motivational fac- 
tors on the other has not been given much 
attention. Even though it is possible to ac- 
count reasonably well for variation in the job 
proficiency of managers in terms of the joint 
effects of traits and motivations in simple 
additive terms, because of the dynamic char- 
acter of motivation it would appear likely that 
some model other than a linear combina- 
tion of variables would be much more effec- 
tive. 

Unfortunately the term interaction is not 
easily defined, though obviously it implies 
that the joint effects of two or more variables 
are not of some simple additive sort. Cer- 
tainly different sorts of methodologies reveal 
different kinds of interactive effects. The in- 
teractive effects disclosed by analysis of vari- 
ance give one sort of result, and the multipli- 
cative rather than the additive combination of 
scores on two or more variables provides still a 
different indication. Recently there has been 
interest in the interactive effects as shown by 
moderator variables. Each of the three ap- 


proaches provides pertinent information, but 
at the same time it cannot be said that any 
one is more effective than any other in indi- 
cating interactions. Likely they all show dif- 
ferent sorts of departures from simple linear 
combinations of variables. 

A proposition that generally is taken as 
basic in psychological measurement holds that 
errors of measurement and errors of predic 
tion are completely random. It follows fro 
this proposition that there are no reliable 
differences among individuals in the extent of 
these errors, and hence any apparent individ- 
ual differences in such errors necessarily must 
be unrelated to all other variables. However, 
it has become increasingly apparent that this 
state of affairs is by no means invariably the 
case, for in numerous instances it has been 
found that the degree of relationship between 
two variables, an indication of the magnitude 
of the errors of measurement and of predic 
tion, is in fact a function of a third variable 
(Ghiselli, 1963a). That is, when individuals 
are sorted out into subgroups on the basis 0 
the scores they earn on one variable, the de- 
gree of relationship between the scores they 
earn on two other variables varies system 
atically over the subgroups. A variable whi 
operates in this fashion, interacting with tw? 
other variables in determining the degree © 
relationship between them for different indi- 
viduals, is termed a moderator. 
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There has been relatively little attention 
devoted to the nature of the variables which 
moderate the relationship between pairs of 
other variables. Intuitively, at least, some psy- 
chological properties seem to be rather 
“static,” whereas others give the impression 
of being dynamic.“ Thus traits such as the 
capacity to solve problems in syllogistic rea- 
soning or in arithmetic tend to be thought of 
as static traits, whereas motivational factors 
such as the desire to express one’s own indi- 
viduality or to dominate others are likely to 
be considered dynamic. Again intuitively, it 
seems probable that variables of the first sort 
would combine in simple additive ways, but 
that when variables of the latter sort are 
combined interactive effects might well be 
found. If this be the case, then it would be 
expected that measures of motivational fac- 
tors would moderate the predictive power of 
measures of traits. 

The traits of intelligence, supervisory abil- 
ity, and self-assurance seem to stand up as 
having significance to performance in sundry 
managerial positions (Ghiselli, 1963b). The 
telationships between scores on measures 

-of these traits and success in mana- 
gerial positions are modest, but none- 
theless seem to be consistently positive. It 
has also been found that three sorts of moti- 
vational factors, the desire for job security, 
for high financial reward, and for self-actuali- 
zation, also play a part in determining the 
success of managers (Ghiselli, in press). A 
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fourth motivational factor, the desire for 
power over others, apparently is unimportant 
in this respect. The purpose of the present in- 
vestigation is to ascertain whether or not these 
four motivational factors moderate the rela- 
tionships between the three trait measures on 
the one hand and the job success of managers 
on the other. That is, for those individuals who 
are strong in a given sort of motivation, is 
the relationship between scores on a trait 
measure and job success different from that 
for those individuals who are weak in that 
motivation? If differences of this sort do oc- 
cur, then it can be said that there are inter- 
action effects between traits and motivational 
factors in the determination of success in 
managerial positions. 


METHOD 


A forced-choice inventory was used which not only 


provides indexes of the traits of intelligence, super- 


visory ability, and self-assurance, but also of the 
motivational factors the desire for job security, for 
high financial reward, for power over others, and for 
self-actualization (Ghiselli, 1963b, in press). This in- 
ventory was administered to 271 persons holding 
middle management positions in a variety of different 
types of business and industrial organizations. For 
each manager a rating of his success in his own 
organizational setting was obtained. These ratings 
were dichotomous, each person was judged as being 
either a more or a less successful manager. Of the 
271 managers, 55% were classified in the upper cate- 
gory and 45% in the lower one, 

On each of the four motivational factors the 
managers were divided into the upper and the lower 
halves as nearly as could be achieved on the basis of 


TABLE 1 


COEFFICIENTS OF CORRELATION BETWEEN THE SCORES EARNED BY MANAGERS ON Five 
Tests AND THEIR JOB SUCCESS FOR SUBGROUPS FOR Wuom EACH or THE 


Four DIFFERENT MOTIVATIONAL 


FACTORS ARE STRONG OR WEAK 


Motivational factors 


Self-actualization 


Test Job security High financial reward | Power over others 
Strong Weak Strong Weak Strong Weak Strong 
Intelligence 11 or 26 05 05 25 47 
Supervisory ability 24 53 AL 42 45 30. 23 
If-assurance 48 .00** —.02 .36"* 35 12 06 
No. cases 134 137 101 169 154 147 121 


* Difference significant at p > .05. 
* Difference significant at p > .01. 
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their scores on the scales measuring those factors, 
Then separately for each of these two subgroups, 
those in whom each given motivation was strong and 
those in whom it was weak, the coefficient of corre- 
lation between scores on each of the three trait mea- 
sures and job success was determined. 


RESULTS 


The findings are shown in Table 1. Of 
the 12 differences between pairs of coefficients 
of correlation, 4 reach statistical significance 
at better than the .01 level of confidence and 
2 at better than the .05 level. The desire for 
job security clearly has moderating effects 
on the relationships between success and 
the traits of supervisory ability and self-assur- 
ance. For those managers who are strongly 
motivated toward job security the relation- 
ship between supervisory ability and success 
is lower than for those in whom that motiva- 
tion is weak. The opposite is the case for the 
trait of self-assurance. The desire for financial 
reward moderates the relationship between 
self-assurance and success, there being no re- 
lationship at all between self-assurance and 
success for those who are strongly motivated 
for such reward, and a fairly substantial re- 
lationship for those in whom such a motiva- 
tion is weak. The desire for power over others 
moderates the relationship between success 
and self-assurance, the relationship being 
higher for those managers for whom the de- 
sire for power over others is strong. Finally, 
the desire for self-actualization moderates the 
relationships between managerial success on 
the one hand, and both supervisory ability 
and self-assurance on the other, the relation- 
ships being higher for those for whom the 
desires are strong. 


Discussion 


All of the four types of motivational fac- 
tors function as moderators. The differences 
between pairs of coefficients of correlation are 
somewhat larger for job security than for the 
other three factors, so possibly job security is 
the motivational factor most likely to produce 
interaction between traits and success. The 
desire for power seems to have least im- 
portance as a moderator since for it the differ- 
ences between the pairs of coefficients appear 
to be smallest. Therefore different types of 
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motivational factors differ in their effective- 
ness as moderators. This means that in the 
determination of the success of managers, 
some sorts of motivations interact with traits 
pertinent to performance in managerial posi- 
tions to a greater extent than do others. Fur- 
thermore, some traits seem to be more sus- 
ceptible to interactive effects than do others 
None of the differences between the pairs of 
coefficients of correlations reaches acceptable 
levels of statistical significance in the case 
of the trait of intelligence, whereas all four 
motivational factors operate as moderators in 
the case of self-assurance. 

In a sense, the desire for security and the 
desire for self-actualization are opposite sorts 
of motivations. Those individuals who seek 
safety in their employment situations would 
be expected to avoid acting in creative ways, 
thereby inviting attention to themselves, 
whereas it is likely that those who seek to 
express their individuality and to use their 
talents to the fullest would not be concerned 
with security. The results of the present in- 
vestigation suggest that with managers who 
eschew either goal, job security or self-actuali- 
zation, supervisory ability plays a much more 
significant part in determining success. Only 
when either of these two sorts of motivational 
factors is relatively weak does the individ- 
ual’s ability to direct the activities of others 
assume a prominent role in determining his 
success in administrative and executive posl- 
tions. 

The four motivational factors dealt with 
in the present investigation can be classified 
into two general sorts. The desires for se. 
curity and for power might be thought of 45 
selfish motivations which protect and gratify 
the ego, and the desire for high financial re- 
ward and for self-actualization perhaps coul 
be conceived of as arising from individu 
strivings, the one being the reward expecte 
for individual action and the other the && 
pression of individuality. Looked at this way 
it then appears that when either the selfi 
motivations are strong or the individual ones 
are weak, then self-assurance assumes a para- 
mount role as a determiner of managerial sut- 
cess. That is, the belief in one’s self has # 
positive effect upon manager’s performance 
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only when his goals are selfish and are not an Giuse, E. E. The validity of management traits 
in relation to occupational level. Personnel Psy- 


ion of his individuality. 
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TASK SUCCESS, TASK POPULARITY, AND SELF-ESTEEM 
AS INFLUENCES ON TASK LIKING* 


ABRAHAM K, KORMAN 


New York University 


Report of 3 experiments which support the general hypothesis that there is a 
positive relationship between task success and task satisfaction for persons who 
hold themselves in high self-esteem but not for those whose self-esteem is 
low. Implications for a general theory of job satisfaction as stemming from the 
moderator influences of personal and task variables are discussed. 


Considerable interest in recent years has 
focused on the extent to which liking for a 
task or job is dependent on the extent to 
which one is successful in the given situation. 
Thus, while some argue that task success is 
important to job satisfaction but not neces- 
sary to job dissatisfaction (cf. Herzberg, 
Mausner, & Snyderman, 1959), others have 
claimed that such a distinction is too limited 
and that it is vital for both job satisfaction 
and dissatisfaction (cf. Dunnette, Campbell, 
& Hakel, 1967). Whichever theoretical ap- 
proach one adopts, and the evidence tends to 
support the latter somewhat more strongly 
than the former (cf. Dunnette et al., 1967), 
it would seem apparent that increasing in- 
terest is being taken in achievement as a pos- 
sible key determinant of the satisfactions one 
may derive from one’s job, particularly since 
experimental evidence continues to be re- 
ported which supports the hypothesis quite 
well (Locke, 1965, 1966). 

Despite such findings, however, some rea- 
son to question the proposed relationship in 
terms of its generality has been indicated. 
For example, in one of the studies (Locke, 
1965) supporting the basic hypothesis it is 
pointed out: 


The author would not predict that there is any 
automatic relationship between task success and task 
satisfaction; rather the relationship should be high- 
est in those cases where Ss are trying hardest to 
succeed and lowest (or nonexistent) in situations 
where they are trying least hard to succeed. If situ- 
ations could be devised where Ss were not trying to 
succeed, the author would predict that the correla- 
tion between success and satisfaction would be 


1 Portions of this paper were presented at the 
meeting of the Eastern Psychological Association, 
Washington, D. C., April 1968. 


lower than the correlation for the same task where 
the Ss were trying to succeed [p. 385]. 


Since a number of writers have emphasized 
that the desire not to succeed and not to do 
well is particularly likely to occur both as a 
function of the personality characteristic of 
the individual and of role demands and ex- 
pectations in lower levels of bureaucratic or- 
ganizations (cf. Argyris, 1964; Korman, 
1967), it would seem desirable that this pos- 
sible limitation of the task-success—task-liking 
relationship be tested. Should such a limita- 
tion be established, this would have important 
implications for those wishing to develop at 
adequate theory of those conditions which 
facilitate satisfaction with one’s employment 
situation. 

The major purpose of the experiments to 
be reported in this paper was to test this 
possible limitation to the task-success-task- 
liking relationship within a self-esteem bal. 
ance theoretical hypothesis to be describe 
below. In addition, as the experiments pf. 
ceeded, other hypotheses were develo 
which were deemed to be of relevance to the 
understanding of task satisfaction. These will 
be discussed at a later point in the paper. 


THEORETICAL FRAMEWORK 


The theoretical framework used in this fe. 
search stems from, but is not identical t0, 
conceptions that have been offered by Vroom 
(1964) and Lecky (1945), in that it is bas 
on a “self-consistency” hypothesis as to Satis 
faction, consistency leading to satisfactio" 
and inconsistency to dissatisfaction. Ihe 
difference between the framework used het? 
and their previous proposals lies mainly along 
three dimensions. First, it lays explicit heavy 
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emphasis on self-evaluation and self-percep- 
tion as part of the cognitive structure, and 
suggests different sources as to consequent 
variations in self-perceptions. Second, it in- 
corporates within a common framework con- 
ceptions of the self-deriving from past rein- 
forcement history as well as contemporary 
determinants of self-esteem (i.e., experimental 
manipulations and other interpersonal evalu- 
ations). Third, the basic theoretical assump- 
tion from which the derivation tested in this 
paper was taken attempts to predict varia- 
tions on the three major dependent variables 
of industrial psychology: work performance, 
work satisfaction, and work choice, within a 
single framework.” 

Thus, the major assumption of the frame- 
work is as follows: 
All other things being equal, individuals will (a) 
choose, (b) adjust their performance to, and (c) 
find satisfying those behavioral roles which will 


maximize their sense of cognitive balance or con- 
sistency. 


This being the case, then the following deriva- 
tion can be made: 


Individuals will tend to find most satisfying those 
job and task roles which are consistent with their 
self-cognitions. Thus, to the extent that an indi- 
vidual has a self-cognition of himself as a compe- 
tent, need-satisfying individual, then, to that extent, 
he will find most satisfying those situations which 
are in balance with these self-perceptions. 


It is assumed further that one’s self-esteem 
or self-evaluation is the extent to which he 
sees himself as a competent, personal, need- 
fulfilling individual. Differences in such self- 
evaluation may arise, for our purposes, in 
three ways. First, one may conceive of a 
telatively chronic level of self-esteem, that is, 
a relatively persistent personality trait that 
occurs relatively consistently across various 
Situations. Second, one may conceive of an 
individual's self-perceived competence con- 
cerning a particular task or job at hand. This 
May arise as a result of differential learning 
experiences or the specific characteristics of 
the moment. Finally, one’s self-esteem is also 
a function of the expectations which others 
have of us. As Tannenbaum (1962) has 
argued, to the extent that others think that 


— ei 


A. K. Korman, A Hypothesis Relating to Work 
Behavior, Unpublished manuscript, 1967. 
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(a) we are competent and need-satisfying and 
able and (>) they exhibit such thoughts by the 
kinds of behavior which they exhibit toward 
us, to that extent our self-perceived compe- 
tence concerning the task at hand is increased 
and so is, we would predict, task performance. 
The rationale for this is that such interper- 
sonal evaluation provides a base of “reality” 
along which we may guide our behavior, as 
well as providing the “reality” which we 
learn from our previous experiences (Deutsch 
& Gerard, 1955). 

From these considerations it was predicted 
in the experiments to be reported below that 
task success and task liking should be posi- 
tively related for high self-esteem and unre- 
lated for low self-esteem. Such a prediction 
was based on what seemed to be the reason- 
able assumption that success in a given situ- 
ation is equivalent to adequacy in that situ- 
ation and hence is in “balance” for high-self- 
esteem people, but not for low. In addition, 
as we have indicated, certain additional pre- 
dictions were made in the last two experi- 
ments, with these to be described at that time. 


EXPERIMENT I 
Method 


The tasks involved in this experiment were Sub- 
tests 1 and 2 of the Watson-Glaser Critical Thinking 
Appraisal, Form AM (Watson & Glaser, 1952), and 
12 words from the medium anagrams lists of Tres- 
selt and Mayzner (1966), (The subtests used are 
titled (1) Inference and (2) Recognition of Assump- 
tions.) The Ss were tested in groups of 15-20 and 
were presented the tasks as part of a normative study 
of college students’ opinions about various ways of 
measuring creativity and intellectual ability. “Suc- 
cess” and “Failure” were determined by E informing 
Ss of “pass-fail” points (based on scores from pre- 
tests) prior to asking them to make their ratings on 
a 5-point scale, but after they had completed the 
task. 

As in previous research by the author (Korman, 
1966, 1967), self-esteem was determined by scores 
on the Self-Assurance scale of the Ghiselli Self- 
Description Inventory,’ with the cutoff for “high” 
and “low” being the median on the nationwide 
norms. Since evidence for the construct validity of 
this scale is contained mostly in an unpublished 
manual, Tables 1 and 2, consisting of a series of 
studies by the author of this paper and others, are 
presented as partial evidence for such validity. An 
examination of the relationships which are signifi- 


s Edwin E. Ghiselli, Self-Description Inventory. 
Undated. 
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TABLE 1 


RELATIONSHIPS BETWEEN GHISELLI SELF-ASSURANCE SCALE AND OTHER PSYCHOLOGICAL MEASURES 


Highs Lows 
Sample Measure — — 

N| M |N| M 
Engineering students | Gough Adjective Check List, Self-Confidence Scale 14 | 52.6 | 20 |46.6 05 

Industrial foremen Miner Sentence Completion Scale (a projective test of 

organizational power orientation) 10 | 4.70 | 12 60 05 
Business students Crites Need for Social Service (Likert-type scale) 35 5.77 | 36 | 7.97 | .05 
Business students Crites Need for Job Freedom (Likert-type scale) 35 | 9.60 | 36 | 8.22 | .10 


Industrial foremen 
and tasks* 
Industrial foremen 
during teens“ 
Liberal arts students 
Business students 


Frequency with which parents supervised Ss’ jobs 
Frequency with which Ss argued with their parents 


Marlowe-Crowne Social Desirability Scalee 89 
Bass Self-Orientation (forced-choice tetrads) 20 


27.10 | 15 | 24.06 | .10 


Note.—Division between highs and lows was based on the median of national norms, All tests of significance are two-tailed. 


a Low score indicates greater frequency. 
b High score indicates greater frequency. 


»The author is indebted to Virginia Dunda and Charles Miller for these data; r = .16, us. 


cant and those which are not, as presented in the 
two tables, seems to offer support for the interpreta- 
tions which are placed on this scale. 

Subjects, The Ss for Task 1 consisted of 35 high- 
self-esteem (HSE) Ss and 36 low-self-esteem (LSE) 
Ss, defined as stated above, while the n’s for Task 2 
were 30 and 55, respectively. (Fifty-one Ss were ad- 
ministered both tasks.) Finally, for Task 3, there 
were 25 HSEs and 42 LSEs, All Ss were introductory 
psychology students fulfilling course requirements. 


Results 


Table 3 summarizes the findings of this 
experiment, providing complete support for 
the hypothesis. There is a significant relation- 
ship between task success and task liking for 
HSE individuals, but no such relationship ex- 
ists for LSE individuals, with this holding 
true for all three tasks. 


Discussion 


The complete support for the hypothesis 
found in Table 3 appears to suggest in a 
fairly conclusive manner, at least for experi- 
mental laboratory tasks: that task achieve- 
ment is important for task satisfaction, but 
only if one is used to task achievement and 
sees it as “proper” and “appropriate” for one- 
self. 

However, the question now arises for the 
LSE individual as to what satisfies him, if 
task achievement does not? The task-liking 
and/or job-liking response occurs for him as 
well as for the HSE individual, but ap- 


parently the determinants are different. Ex- 
periment II was designed to provide a pre- 
liminary answer to this question with its 
major hypothesis being that task liking for 
the LSE individual is primarily a function 
of social influence processes. That is, it 
was predicted that, for LSE individuals, if 
others like the task they would, and if others 
did not they would not; the logic for this 


t 
hypothesis being that we tend to use others 
9 


TABLE 2 


RELATIONSHIPS BETWEEN GHISELLI SELF-ASSURANCE 
SCALE AND RANKING AND RATING VERSIONS OF 
THE STEIN SELF-DEscRIPTION INVENTORY 


Ranking M Rating M 
Need scale 1 2 
Highs | Lows Highs | Lows 
Abasement 16S || 16.2 | ‘ns 3.6 05 
Achievement 6.2 9.1 05 E 
Affiliation ns 15 
Aggression 1 i 


Autonomy 
Blamavoid 
Counteraction 
Defendence 
Deference 
Dominance 
Exhibition 
Harmavoid 
Infavoid 
Nurturence 
ler 


Play 


Bee 


N 
SS SNN 


ee 


Rejection 
Sentinence 


* 
Succorance 


FF 
SSS 
nenne 

pi 

2 
DD D WD N D pA Ga oa I a Ga O a a a N S G 
SS ROOIEN ARION 
SEED e 
ACAUFEEACHONWOCOCHRAU 

= 

= 


= 69 college 


Note.—Low mean indicates strong need; author i 


students. All significance tests are two-tailed. Ny he 
indebted to Charles Miller for these data. 
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TABLE 3 


LIKING For Tasks As A Function or Task 
Success AND SELP-ESTEEM 


Low self-esteem | 


Inferences 
Succeed 
Failure 

Recognition 
of assump- 
tions 
Succeed 


1. 
1. 


is 

8 
ee 
88 


3.80 
3.08 


0 13 


2.38⁰ 
62 14 
1.1 16 


oe 
wo 


.68 | 14 


3 
3 
5.07 
3 
1.211 3 


ne 
miv 
8 


Failure 


4 


1.20 


*p 8.05. 
Wp < 01. 


as a guide as to how we should evaluate vari- 
ous phenomena in the environment (including 


High self-esteem] 
Task — 137 
M | SD|N m | sp\N| 


ourselves) (cf. Deutsch & Gerard, 1955). It 
was also predicted that within each task 
there would be no relationship between task 
success and task liking. On the other hand, 
for the HSE individual, it was predicted that 
there would be such task-success—task-liking 
relationships for each task (i.e., high “others 
like“ and high “others do not like”). The 
l prediction for these latter individuals as to 
“social influence effects” between the two 
tasks was less clear, however. On the one 
hand, there seemed to be no reason to expect 
a lack of a “social influence” effect for HSE. 
Yet, it also appeared logical that the task- 
success-task-liking relationship might mitigate 
this effect somehow. For these reasons, no 


specific prediction was made as to how much 
“social influence” might affect the HSE indi- 
vidual liking for a task, except that it was 
thought that the effect would be somewhat 


less than for the LSE Ss. 


EXPERIMENT II 
Method 


The task involved in this experiment was the same 
anagrams task used in the first experiment, that is, 
12 words from the anagrams list of Tresselt and 


Mayzner (1966), In addition, all other procedures 
were exactly the same with the sole exception that, 


in this case, after the task was completed and the 
individual was informed of his “success” or “failure, 


he was also told by E that most other students 


“did” 


like the task or “did not” like the task. After such 


information was presented, Ss 


were then asked to 
rate their own liking for the task on a 10-point scale. 


487 


As before, the measure of self-esteem used was 
the Ghiselli Self-Assurance scale. 

Subjects. All Ss were introductory psychology stu- 
dents fulfilling a course requirement. The breakdown 
by self-esteem scores resulted in 57 HSEs and 72 
LSEs, with the division point being the median on 
the national norms. 


Results 


The analysis consisted of an “unweighted 
means” ANOVA (Winer, 1962), analyzing 
each self-esteem group separately, the predic- 
tions being that there would be a pass-fail 
effect for the HSE individual and possibly a 
“social popularity” effect, whereas only the 
latter would be true for the LSEs. The results 
are summarized in Table 4 and indicate com- 
plete support for the moderator influence of 
self-esteem on the task-success—task-liking re- 
lationship. However, support for the differen- 
tial importance of social influence processes 
on liking for a task, when success on the task 
is controlled for, as a function of self-esteem, 
does not occur. In fact, the effect occurs more 
strongly for the HSE group than it does for 
the LSE, a result opposite to the prediction, 
but the effect is not significant for either. 


EXPERIMENT III 
Method 


This third experiment constituted a replication of 
the second experiment with the following exceptions: 

(a) The task used was the Inferences task from 
the Watson-Glaser Critical Thinking Appraisal; 

(b) In addition to E stating that other students 
liked or disliked the task, an accomplice ! was used 
to say that he liked or disliked it and to request of 
the E that he state his opinion. All three of these 
opinions were confirming of one another and thus, 
hopefully, provided a stronger “popularity” manipu- 
lation effect than in Experiment II (although a 
request for opinions after Experiment IT similar to 
the one we will describe below for Experiment III 
indicated that the “popularity” manipulation was 
believed by all Ss). 

(c) Most importantly, self-perceived competence 
for the task (a component of self-esteem as we have 
defined it) was manipulated by £ in the following 
manner. In Treatment A, Ss were presented with the 
task and told that this was the first time this norma- 
tive opinion study was being done at New York 
University. Prior to this time the test had been ad- 
ministered at Ivy League institutions which were 
highly selective in their admissions policies and it 
was from these individuals that we had our knowl- 


4 The author is indebted to James Hoon for his 
assistance in this experiment. 
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TABLE 4 


LIKING For Tasks as A Function or Task Success, SELF-ESTEEM, AND OTHERS’ 
LIKING FOR THE TASK FOR EXPERIMENT IT 


High self-esteem Low self-esteem 
Statis- | Succeed | Fail Succeed Fail 
tic : ie = zS 
| z Others do x Others do | Others do | Others do 
Others like not like | Others like not like Others like abe like Others like | not like 

M 7.13 5.83 4.93 3.58 5.95 5.38 6.27 | 5.00 
SD 2.2 3.1 2.7 | 2.7 2.1 2.4 2.3 2.8 

N 12 18 15 12 19 21 22 | 10 


Note.—High self-esteem succeed-fail F = 9.53, p < .01; low self-esteem succeed-fail F < 1; high self-esteem others like-do 


not like F = 3.65; low self-esteem others like-do not like F = 2.59 


edge as to what average performance and liking 
levels were. Now we (the researchers) were moving 
into more “mass” institutions where students were 
of a much wider range in quality and where the 
average student was considerably below the Ivy 
League student in ability. Hence, the researchers did 
not expect that they would get the same levels of 
performance here. 

In Treatment B, the same basic procedure was 
used except that Ss were told that previously the 
researchers had been working on junior and com- 
munity colleges and were now moving up to more 
“quality” institutions where they expected the level 
of performance to be higher. In both this treatment 
and in Treatment A, well-known examples were 
given of “high” and “low” quality institutions with 
which the Ss at the institution were being classified 
in order to solidify further the Es’ evaluation of 
their abilities. 

The effectiveness of the general experimental pro- 
cedure was checked by asking each S to write a 
paragraph indicating his reaction to the whole 
research situation in which he was participating. 
Of 179 Ss, only 1 S indicated that he did not be- 


lieve the normative purposes of the study, while 
only 4 did not believe the fact that the Es worked 
in the universities and/or colleges indicated previ- 
ously, These Ss were, of course, eliminated. Of the 
175 remaining, every paragraph written indicated 
that they took seriously the stated purposes of the 
study as indicated by the E. In addition, their com- 
ments indicated that the “popularity” manipulation 
Was accepted as well as in Experiment II. 

Subjects. All Ss were introductory psychology stu- 
dents fulfilling course requirements, with 94 being 
tun in the “high” condition and 80 in the “low.” 
These were further broken down, as in the second 
experiment, into high others-like-it and low others- 
like-it, as well as pass-fail, with these breakdowns 
being indicated in Table 5. 


Results 


The analysis performed here used the same 
procedures as in Experiment II, but with 
different results. There is an interaction effect 
for the high group indicating that achieve- 


TABLE 5 


LIKING FOR Tasks AS A FUNCTION OF Task Success, SELF-ESTEEM, AND OTHERS? 
LIKING FOR THE TASK FOR EXPERIMENT III 


High self-esteem Low self-esteem 
Statis- Succeed Fail Succeed Fail 
tic 
Others do ` Others do P Others do à Others do 
Others like not like | Others like not like Others like not like | Others like | not like 
M 7.53 4.10 5.82 4.58 6.96 3.80 6.19 4.35 
SD 2.0 1.9 2.7 1.7 1.8 2.0 1.4 2.6 
N 28 37 17 12 30 20 16 14 


Note.—High self-esteem succeed-fail F = 1.81; low self-esteem succeed-fai 


like F = 23.7, p < .01; low self-esteem others li 


ke-do not like F = 34.4, % < 01 


il F = 2,27; high self-esteem others like-do not 
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ment has an effect on satisfaction only for 
the condition where others “like” the task, 
but that there is no effect when others do 
“not like” it. There is an others like/dislike 
the task effect for both groups, but there is no 
pass-fail effect in either case for the low group. 


Discussion 


Taken in context, the results of these ex- 
periments are highly consistent in supporting 
the general hypotheses advanced in the intro- 
duction to this paper. It seems apparent that 
the meaning of satisfaction and liking varies 
as a function of the self-esteem variable and 
what induces liking for a phenomenon in one 
group will not induce liking in the other. We 
may consider then that the balance hypothe- 
sis advanced here has received support similar 
to that which it has received in other papers 
(cf. Korman, 1966), and that claims as to 
the great significance of task achievement for 
satisfaction are oversimplified at best. For 
some people, it is apparent, task achievement 
is not a source of satisfaction nor is task 
failure a particular source of dissatisfaction. 
To be more specific, task achievement to the 
LSE person is apparently irrelevant to his 
satisfactions. In addition, it also appears as 
though a task must meet at least some stan- 
dards of “social desirability” before achieve- 
ment on it will serve as any kind of satisfac- 
tion at all, People for whom task achievement 
is a source of satisfaction will not feel such 
satisfaction if the task is not worth achieving 
on. 
What, then, does determine the satisfied 
response of the LSE individual, assuming 
his satisfied response is a lawful response? 
Our hypothesis that social influence processes 
may be involyed in that perhaps the LSE 
person uses other individuals’ response as a 
guide to his own response in a given situation 
was supported under the “strong” manipula- 
tion, but not the “weak.” Since authority (E) 
influence was also involved as well as peer 
evaluation, we have little way of knowing 
whether it was the additive aspects of the 
influence in Experiment III which were im- 
portant or its qualitatively different aspects. 
(The fact that the “popularity” manipulation, 
as measured by postexperimental opinions, 
was equally accepted in both cases only adds 
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to the inconsistency of the findings.) Further 
testing of this hypothesis seems to be neces- 
sary, particularly when it might be moderated 
in its effects by the self-esteem variable, 
something which did not occur here. One pos- 
sibility which might be investigated is that 
the effects of self-esteem might be curvilinear, 
rather than linear, as recently suggested by 
Cox and Bauer (1964). 

What does seem to be clear, however, from 
these results, are the implications they sug- 
gest as to how to increase the feeling of satis- 
faction one gets from task achievement, Ap- 
parently what appears to be necessary is that 
one must first change the individual in order 
that he sees achievement as befitting him and 
being appropriate for him. It is only after 
such personal change has taken place that he 
will be satisfied with achievement on a de- 
sirable task and gain pleasant effect from it. 
How does one introduce this in the organiza- 
tional environment? There are various ways 
of varying one’s feelings of self-esteem and 
self-competence through interpersonal evalu- 
ation, varying control procedures and leader- 
ship behavior which seem to be quite com- 
mon in the organizational environment, as 
Argyris (1964) and Korman (see Footnote 
2), among others, have pointed out, and which 
would seem to be quite germane to this hy- 
pothesis. Hence, it would seem that a pro- 
gram of experimental research using these 
organizationally relevant variables would be 
of considerable value in further testing this 
hypothesis, particularly when task character- 
istics are also varied, a type of change al- 
ready of some interest and concern in the 
modern organization. 
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VOCATIONAL INTERESTS AND A MEASURE OF 
VOCATIONAL NEEDS * 
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The Minnesota Vocational Interest Inventory (a measure of the interests of 
nonprofessional men) and the Minnesota Importance Questionnaire (a measure 
of vocational needs) were administered to 505 male vocational rehabilitation 
clients. The group was split into 2 random samples of 246 and 259, and a 
canonical correlation analysis was developed on each group independently. 
Significant canonical correlations of .65, 56, and .52 were obtained for Group 
1, and .62 and .54 for Group 2. The first 2 sets of weights for each group 
were cross-validated on the other group. The ist set of canonical weights 
from each group yielded identical cross-validation coefficients of 38. Possible 
interpretations of the canonical variates and some theoretical and methodologi- 


cal considerations are discussed. 


The purpose of the present study was to 
test the hypothesis that measured vocational 
interests and vocational needs reflect a com- 
mon domain of psychometric behavior. Various 
attempts to relate interest measures to person- 
ality variables by zero-order or multiple- 
correlation methods have generally yielded 
low to moderate relationships. For example, 
Kohlan (1966) correlated the Strong Voca- 
tional Interest Blank (SVIB) and the Minne- 
sota Importance Questionnaire (MIQ), a 
measure of vocational needs (Weiss, Dawis, 
Lofquist, & England, 1966). In predicting 
each individual SVIB scale from the 20 scales 
of the MIQ, Kohlan obtained multiple Rs 
ranging from .28 to .62 with a median of 
42. The low magnitude of relationship found 
is typical of that appearing generally in the 
literature, leading Thorndike, Weiss, and 
Dawis (1968) to argue for the use of canoni- 
cal correlation for such multivariate prob- 
lems. In reanalyzing Koblan's data, Thorn- 
dike et al. (1968) obtained a maximum 
canonical correlation coefficient of .78 be- 
tween the 45 SVIB occupational scales and 
the 20 MIQ scales. This coefficient accounted 
for 62% of the variance of the canonical 
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variates. A replication study with these in- 
struments yielded similar results. 

The present study presents results on the 
canonical correlation of a different measure 
of vocational interests, the Minnesota Voca- 
tional Interest Inventory (MVII), with the 
same measure of vocational needs, the MIQ. A 
secondary objective of the present study was 
to determine the reliability of the obtained 
canonical relationships by subjecting the 
canonical regression weights to cross-valida- 
tion. 


METHOD 


The Ss for this study were 505 male applicants for 
services at the Minnesota Division of Vocational Re- 
habilitation. All Ss completed both instruments in one 
testing session as part of a larger battery of tests 
given to assist in vocational counseling. The Ss were 
administered the instruments in small groups 
throughout 1966 as part of the continuing research 
program of the Work Adjustment Project at the 
University of Minnesota. 

After the data had been collected, the answer 
sheets were ordered according to a six-digit identifi- 
cation number and a table of random numbers was 
used to generate two groups of 246 and 259, respec- 
tively. Tests for differences of scale means and vari- 
ances showed that the two groups did not differ in 
these respects. Possible effects of age, education, and 
marital status were also effectively minimized by the 
randomization procedure. 

The data from the two samples were then sub- 
jected to independent analysis using the technique 
of canonical correlation, a correlational method 
which determines the maximum relationship between 
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two weighted liacar composites (Cooley & Lohnes, 
1962; Hotelling, 1916). The 21 MVII empirical scales 
occupational keys) were treated as one set of varis- 
bles and the 20 MIQ scales as the other. Due to 
the multiple and simultaneous determination of the 
regression weights in either set by the variables in 
the other set, the predictor-criterion distinction of 
the multiple correlation technique does not apply, 
and scores on either set of variables can be pre- 
dicted from those of the other set. The computer 
program used (Cooley & Lohnes, 1962) calculates 
the canonical regression weights for the larger set of 
variables, then the weights for the smaller set of 
variables for each successive canonical correlation in 
turn. The canonical variates may be thought of as 
“factors” extracted from each set of variables such 
that the correlation between the “factor scores” (the 
canonical correlation) will be a maximum for each 
successive pair of variates, The magnitude of the 
correlation coefficient indicates the degree to which 
these “factor scores” (weighted linear composites of 
the original “predictor” and “criterion” variables) 
covary with each other or can be predicted from 
each other. 

After the sets of canonical regression weights had 
been determined for each group, they were applied 
to the data for the other group in a double cross- 
validation design. Predicted total scores on each 
instrument were computed, based on the beta 
weights previously obtained from the other sample, 
and the product-moment correlation between these 
predicted total scores was obtained for each group. 

8 fad a canonical correlation is 
a to of multiple correlation, ex that 
the “actual criterion score” is the predicted salad 
score of the “criterion” set. 


REsutts 


The canonical correlations resulting from 
the development equations are shown in Table 
1. For Group 1, three sets of weights yielded 
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statistically significant canonical correlation 
coefficients of .65, .56, and .52, accounting for 
43%, 31%, and 27% of the variance of the 
canonical variates, respectively. The data for 
Group 2 yielded two significant canonical cor- 
relation coefficients of .62 and .54 which ac. 
counted for 38% and 30% of the variance of 
these canonical variates. 

Tables 2 and 3 list the scales scored on each 
instrument and the beta weights for the sig- 
nificant canonical correlations, rounded off to 
two decimal places, In using the weights for 
cross-validation, five decimal places were re- 
tained. It should be noted that the weights 
for the first canonical correlations of the two 
groups are more similar than the weights for 
any two successive canonical correlations for 
the same sample. This is to be expected due 
to the fact that succeeding pairs of canonical 
variates for a given sample must be orthogonal 
to all preceding pairs. 

Table 2 shows that, for the first canonical 
correlation for Group 1, the variate defined 
by the MVII scales Milk Wagon Driver, 
Printer, Truck Driver, and Plumber was re- 
lated to the variate defined by the MIQ Re- 
sponsibility scale (positive weight) and by 
the MIQ Activity and Independence scales 
(negative weights), Table 3 shows that, for 
the first canonical correlation for Group 2, 
the variate defined by the Milk Wagon Driver, 
Stock Clerk, Printer, and Truck Mechanic 
scales of the MVII was related to the variate 
defined by the Responsibility and Supervision- 


TABLE 1 


CANONICAL CORRELATION, WILKS’ 


LAMBDA, VALUE OF THE CHI-SQUARE STATISTIC, AND SIGNIFICANCE 


TESTS OF THE FIRST SEVERAL ROOTS OF THE INTERCORRELATION MATRICES RY Group 


Root 
. Es es PP 
1 
Group 1° 4 
4 
1 
Group 24 2 
3 


be associ ated latent root, Mi = 
eN = 246, 
aN = 259, 


df z 
s u 

420 <.001 
380 <.005 
342 <.05 
306 >.05 
420 <.001 
380 <.05 
342 >.05 


The t, Res. 
Probability of error jn rejecting the null hypothesis of no significant difference from zero canonical correlation. 
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TABLE ? 
Caxoxicat Reosesson Wears ros tee Tusze Soxtrocaxwr 


MIQ weights 


Seale | Ru = 68| Ra = $6 | Ra = 52 


Ability Utilization | 05 » | -10 
Achievement -16 =n | Sgt 
Activity | =35 — 05 — 
Advancement a — 0 | 17 
Authority 10 — 1 | M 
Company Policies 
and 7 — 04 AS 
Compensation oT = OS — 08 
Co-workers os 6 | M 
Creativity 2» 3 ES] 
-46 3 | -2 
Moral Values 06 mM -28 
Recognition 14 06 — 16 
Responsibility 35 -9 | -9 
Security -3 — 26 » 
Truc Social Service n o 00 
Industrial Education Social Status 06 17 -M 
Teacher d 90 | Supervision—Hu- 
Sheet Metal Worker è t 10 man Relations 0 0⁰ -09 
Plumber i —.24 | Supervision- 
Machinist ` i 04 | Technical — 02 -23 — 8 
Hlectrician j —.39 | Variety 20 18 » 
Radio-TV Repairman a © | Working Conditions | —.16 10 D 


and the Independence scale (negative weight) 
of the MIQ. Scales receiving large weights on 
the first pair of variates were found in com- 
mon for each instrument and for both groups: 
Milk Wagon Driver and Printer (both re- 
ceived positive weights) for the MVII and 
Responsibility (positive) and Independence 
(negative) for the MIQ. Comparison of the 
= weights for the second canonical correlation 
does not reveal the same similarity between 
A groups. 
Buy emphasizing those scales which show 
_ fairly reliable and large weights across the 
two samples an attempt can be made to in- 
terpret the canonical vectors. On the MVIT 


whose interest response patterns are similar 
to those of Printers and Milk Wagon Drivers 


ate occupational groups would be necessary 
before such an interpretation could be ac- 
cepted as having been empirically demon- 
strated. 

The weights which define the vectors of the 
second canonical correlation are so different 
for the two randomly developed groups that 
an interpretation of the above type seems un- 
warranted. 

Table 4 contains the correlations obtained 
when the weights developed on Group 2 are 
applied to the data from Group 1 and vice 
versa. The first set of weights developed on 
Group 2 yielded a statistically significant 
correlation of .38 when applied to Group 1, 
while the second set of weights from Group 2 
did not yield significant results. When the 
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TABLE 3 


CANONICAL WEIGHTS FOR THE Two SIGNIFICANT CANONICAL CORRELATIONS FROM Group 2 


MVII weights 


MIQ weights 


Scale Ra = .62 | Reg = .54 Scale Ra = .62 | Ra = .54 

Baker 07 —.11 [Ability Utilization .20 01 
Food Service Manager —.72 32 | Achievement 23 — 23 
Milk Wagon Driver 1.07 14 | Activity —.15 44 
Retail Sales Clerk 68 76 | Advancement .09 10 
Stock Clerk 1.01 —.69 Authority 19 —.09 
Printer 1.68 22 | Company Policies and Practices 06 —.03 
Tab Machine Operator —.10 29 | Compensation —.05 —.12 
Warehouseman —.22 56 | Co-workers 17 = AS 
Hospital Attendant 42 63 | Creativity —.03 —.21 
Pressman = 103 84 | Independence —.46 —.2 
Carpenter -17 52 | Moral Values —.03 02 
Painter 43 07 | Recognition .06 .33 
Plasterer .09 —.29 | Responsibility 56 —.04 
Truck Driver 68 30 | Security —.14 06 
Truck Mechanic 1,29 —.67 Social Service 02 34 
Industrial Education Teacher 80 —.99 Social Status —.04 — 2 
Sheet Metal Worker 25 34 Supervision Human Relations 37 45 
Plumber 06 -79 | Supervision—Technical —.13 —.34 
Machinist 46 -68 | Variety 04 12 
Electrician 56 1.75 Working Conditions —.32 —.05 
Radio-TV Repairman 38 — 1.43 

first two sets of weights developed on Group canonical equations, which involved 41 


1 were applied to Group 2, the first set yielded 
a correlation of .38 and the second a correla- 
tion of .28. Both of these correlations are 
Statistically significant at p < .01. 


Discussion 


The fact that significant correlations of 38 
were found in both of the cross-validation 


TABLE 4 


Cross-VALIDATION CORRELATION COEFFICIENTS. 
F TESTS, AND SIGNIFICANCE LEVEL FOR 
THE Frrst Two SETS Or WEIGHTS 
FROM Eacir Group 


Set r F p 
Group 2 weights applied to 
Group 1 
First canonical weights -38 | 41.7 <.001 
Second canonical weights —.05 762 >.05 
Group 1 weights applied to 
Group 2 
First canonical weights 38 | 42.8 <.001 
Second canonical weights 28 21.5 2.001 


a Probability of error in rejecting null hypothesis of no dif- 
ference from zero correlation, 


weights each, is noteworthy. These results 
Suggest a stable multivariate relationship be- 
tween measured vocational interests and mea- 
sured vocational needs. 

The authors attempted an interpretation of 
the first set of canonical weights, but only in 
the presence of cross-validation evidence 
which very strongly indicated that such an 
interpretation was justified. Other authors 
(e.g, Dunteman & Bailey, 1967) have in- 
terpreted canonical variates in cases where 
the evidence was less strong. Although those 
authors obtained two slightly larger correla- 
tions (69 and 61) for the first two pairs of 
vectors, their sample was smaller and they 
reported neither a replication nor a cross- 
validation of their results, The present study 
shows that cross-validation or replication 1s 
required before any attempt at interpretation 
of canonical variates can be made. 

The fact that a correlation of .54 became 
—.05 when the development weights were 
applied to a second random sample from the 
Same population is suggestive of the amount 
of sample-specific variance that can be in- 
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volved in the calculation of canonical corre- 
lations. An interpretation of the second set of 
weights from Group 2 would have been mean- 
ingless for Group 1. It would therefore seem 
advisable to put little confidence in the in- 
terpretation of a pair of canonical variates 
developed from a single sample unsupported 
by additional evidence in the form of either 
a replication which produces very similar 
results or a cross-validation. Unfortunately, 
such interpretation is commonly included in 
the meager but growing number of research 
reports using canonical correlation techniques. 

The results of this study lend support to 
an important trend in research methodology. 
Various authors, particularly Dunnette 
(1963), have argued that a single criterion 
measure is not adequate in the applied pre- 
diction situation because what is being pre- 
dicted is very seldom a unidimensional at- 
tribute which can be adequately expressed as 
a single criterion variable. The use of a priori 
weights in a composite criterion is not very 
much better because the investigator seldom 
has a very accurate knowledge of what it is 
in the composite that he is in fact predicting. 
However, this less than desirable methodology 
has continued in common use because of the 
restrictions in the correlational techniques 
which have been used. 

One of the weaknesses which is often men- 
tioned in using the canonical correlation tech- 
nique in an applied context is that the “cri- 
terion” weights developed to satisfy a mathe- 
matical least-squares solution may not reflect 
the values of the variables expected by the 
researcher. In this regard, it should be under- 
stood that canonical correlation analysis is 
primarily a research technique, the purpose 
of which is to show the investigator what his 
data imply. Canonical correlation analysis 
identifies the “criterion” variables which the 
“predictor” battery predicts and their rela- 
tive importance from a linear-weighted com- 
Posite point of view. If the “criterion” varia- 
bles were designed to be used as part of a 
composite criterion, the canonical weights give 
the best regression weights for maximum pre- 
diction of that linear-weighted composite. 
Divergence of these canonical regression 
weights from the researcher’s a priori weights 
should result in a reexamination and modifi- 


495 


cation of the predictor battery so that the em- 
pirical and a priori weights become less dis- 
crepant, or in a modification of the research- 
er’s a priori weighting system. 

Although the method of canonical correla- 
tion has been known for over 30 years, the 
difficulties of computation have caused it to 
be generally disregarded until quite recently. 
This method, although still a tool for research 
rather than practical prediction, allows the 
applied psychologist to explore the variables 
of interest to him in the complex multivariate 
manner in which they occur. 

In the context of the present substantative 
question, that of the relationship between 
measured vocational interests and measured 
vocational needs, canonical correlation has 
served to uncover a relationship hidden by 
other techniques. The present study, in con- 
junction with the previous study using the 
SVIB (Thorndike et al., 1968), suggests a 
moderate (cross-validated R.  .38) relation- 
ship between vocational needs and the mea- 
sured interests of nonprofessional men, and 
probably a relationship of similar magnitude 
(un-cross-validated R. .75) between mea- 
sured vocational needs and the measured in- 
terests of professional men. Further, it has 
been shown that these relationships do not 
exclusively reflect the operation of chance 
factors in that cross-sample stability exists 
for the magnitude of the relationship and, 
in the present study, that the pattern of 
weights defining the relationship yields sta- 
tistically significant results when applied to 
another sample from the same population. 
These data, therefore, suggest that vocational 
interests and vocational needs reflect the op- 
eration of similar underlying variables. 
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A SET OF BASIC INTEREST SCALES FOR THE STRONG 


VOCATIONAL INTEREST BLANK FOR MEN! 


DAVID P. CAMPBELL, FRED H. BORGEN, SUZANNE H. EASTES, 
CHARLES B. JOHANSSON, Ab ROBERT A. PETERSON 


University of Minnesota 


22 homogeneous content scales were developed for the Strong Vocational Interest 
Blank (SVIB) by clustering items with high intercorrelations. The resulting scales 
were named: Adventure, Agriculture, Art, Business Management, Law/Politics, 
Mathematics, Mechanical, Medical Service, Merchandising, Military Activities, 
Music, Nature, Office Practices, Public Speaking, Recreational Leadership, Reli- 
gious Activities, Sales, Science, Social Service, Teaching, Technical Supervision, 
and Writing. Mean scores are presented on each of these scales for 113 occupational 
samples, and test-retest statistics are included for groups tested and retested over 2 
wk., 30 days, 3 yr., 22 yr., and 30 yr. Similar information is presented for several 
student samples including a group who were tested as freshmen, sophomores, Syr., 
and 15 yr. after graduation. The main purpose of these scales is to provide some in- 
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terpretation aids for the traditional occupational scales of the SVIB. 


The regular scales now in use for the Strong 
Vocational Interest Blank (SVIB) were de- 
veloped by comparing the responses of men in 
Specific occupations with those of a men-in- 
general (MIG) group. Those items selected 
more often by the occupational sample serve 
as a scale for identifying the characteristic 
interests of the men in that occupation. These 
are usually termed “empirical” scales, and the 
Scores provide an index of similarity between 


the individual’s interests and the interests of. 


men in the specified occupations. A substantial 
amount of research (see, e.g., Campbell, 1966; 
Darley & Hagenah, 1955; Strong, 1943, 1955) 


The statistics reported in this paper were expensive 
to accumulate, and many agencies provided assistance. 
he overall project was funded by the National Insti- 
tutes of Health through Grant 1428-03, most of the 
Computer time was furnished by the University Com- 
buter Center, University of Minnesota, and considerable 
clerical help was supplied by the Student Counseling 
ureau, University of Minnesota. 
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has shown that these scores are useful in psy- 
chological assessment projects and in counsel- 
ing situations. 


Problems with the Empirical SVIB Scales 


The major advantage of these empirical 
scales is that they include all of the discrimi- 
nating items for each occupation in one scale; 
this makes interpretation for that one occu- 
pation fairly easy. With these scores, a counse- 
lor can say something like, Vou have interests 
similar to lawyers.” However, the nature of 
empirical scales makes further psychological 
interpretation difficult. If the individual asks 
the obvious, “What does it mean to have 
interests similar to lawyers?” the counselor 
must fall back on what he has learned about 
lawyers from other sources. Although the re- 
lated scales on the SVIB profile provide some 
general flavor, interpretation is still shallow 
except when done by well-trained sophisti- 


0. 
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TABLE 1 
SAMPLE SVIB ITEM INTERCORRELATIONS 
% 500 Men In General 
It 
= Like | Indif-| Dis- | Total 
ferent | like 
Algebra 
Arithmetic 
Like 51 12 06 69 
Indifferent 03 10 06 19 63 
Dislike 0¹ 01 10 12 
Total 55 23 22 100 
Geometry 
Chess 
Like 16 04 03 23 
Indifferent 24 12 06 42 21 
Dislike 15 11 09 35 
Total 55 27 18 | 100 
Actor 
Listening to a 
story vs. 
Telling a 
story 
Like 09 19 24 52 
Indifferent 06 10 16 32 | —.19 
Dislike 08 04 04 16 
Total 23 33 44 | 100 


cated counselors who have had considerable 
experience in working with the SVIB and who 
have studied the research literature, 

A second major disadvantage is that there 
is no limit to the number of empirical scales; 
in theory one could be developed for each occu- 
pation—but there are over 20,000 occupations 
listed in the Dictionary of Occupational Titles. 
While one can, in only partial fantasy, visualize 
a computerized system where this information 
would be available for hundreds of occupations, 
this is still far beyond current capabilities, In 
recent practice, to compensate for this rela- 
tively narrow coverage of the occupational 
world, the counselor usually extrapolates from 
the available SVIB scales to other occupations 
not listed on the profile. For example, if a 
student inquires about geology, scores on the 
Engineer and Chemist scales are probably 
relevant. Some,such extrapolation will be in- 


evitable, no matter what the final method used, 
but it should be as easy and direct as possible. 

A third disadvantage of these empirical 
scales is that they are difficult to work with in 
research studies. If an investigator wishes to 
study men who have survived in a specific 
occupational setting versus those who have 
not, he finds it cumbersome to compare these 
two groups on all of the 60 current empirical 
scales. While he can, and usually does, calcu- 
late the mean differences on each of the scales, 
the resulting statistics do not offer either a 
parsimonious nor easily interpretable method 
of understanding what distinguishes between 
the interests of these two samples. 

The empirical success of the SVIB has 
created most of these problems. Because the 
scales are useful and do provide an easily inter- 
pretable score for a specific occupation, there 
has been considerable pressure to build more 
of them, further increasing the complexity of 
the profile. From all indications, this will con- 
tinue as more occupations feel the need for 
self-study. This is good, for this detailed 
psychometric information on a wide variety of 
Jobs is essential, but this prospect makes it 
even more imperative to develop a simpler way 
to summarize the results. 

What both the counselor and researcher need 
is another system of scoring to supplement the 
occupational scales, a system containing rele- 
tively few scales, but scales which could be 
used to generalize beyond a single occupation. 

Clark (1961), in his research with the Minne 
sota Vocational Interest Inventory, has show? 
that one way to do this is to work with mea 
sures that are homogeneous in content. Each 
of his homogeneous scales (or area scales 48 
they are called on the profile) reflects interests 
in one type of activity or in closely relate 
activities; thus, they are “pure” content scales. 

Such a set of scales has been constructed for 
the men’s form of the SVIB; this monogra 
is a report of their development and of the! 
psychometric characteristics. Because this Na? 
an attempt to identify the important cue 
of interests represented in the SVIB, thes 
scales have been termed “basic interest 
scales, 


Basic Scare DEVELOPMENT 


11 a ; a 
To build these scales, an item intercorte 
tion matrix was generated for the SVIB ane, 
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TABLE 2 
ADVENTURE CLUSTER 
Item 211 „ | asi | 12 | sm | so | ar |as | a 

211 Pursuing bandits in a sheriff’s posse 24 26 37 23 H 

14 Airplane pilot 25 27 37 16 22 
231 Climbing along edge of precipice 23 12 25 19 10 

12 Auto racer 28 40 11 22 32 
334 Taking a chance 13 05 16 17 11 

86 Secret Service man 42 29 15 30 14 

277 Men who live dangerously* 21 22 20 25 16 27 
337 Thrilling, dangerous activities* 36 18 26 28 62 25 31 
321 Airline pilot 18 51 13 23 12 18 22 30 


Note. Cortelations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s, Decimals omitte 
s New item from the 


from this matrix, clusters of items with high 
intercorrelations were identified. The items in 
these clusters became the basic scales. 


Set 1 and Set 2 of the Basic Scales 


Actually, two sets of scales were constructed, 
the first using the 291 items common to all 
editions (1927, 1933, 1938, and 1966) of the 
SVIB booklet, the second using all 399 items 
of the 1966 revised form. 

Although the first set will usually not be 
used in current projects either for counseling 
or research, three factors dictated its develop- 
ment: first, practically all of the SVIB cri- 
terion groups were studied with either the 
1927, 1933, or 1938 booklets; second, many 
longitudinal studies have old data on an 
earlier booklet and current data on a later 
form; third, a more practical concern despite 


1966 booklet, added to develop the Set 2 scales. See text. 


the efforts of the publisher and the scoring 
services, there inevitably are users who have 
not yet switched to the revised (1966) form. 
For all of these situations, a set of scales com- 
patible with both early and recent booklets is 
needed. 

The second set of basic scales is simply the 
first set expanded to include the items added 
to the SVIB in the 1966 revision. In this re- 
vision, a definite attempt was made to improve 
the coverage in areas relatively neglected in 
the original item pool such as art, religion, and 
music. As a result, scales for those areas are 
longer and more adequate in the second set. 
Unless confronted by one of the situations in 
the preceding paragraph, current users of the 
SVIB should use this second set. 

Because most of the normative data pre- 
sented here are from groups tested before 1966, 


TABLE 3 
AGRICULTURE CLUSTER 


Item 37 | 102 | 342 | 345 

37 Farmer 65 24 22 
102 Agriculture 58 27 28 
342 Outside work 21 27 40 
345 Physical activity 20 19 40 

76 Rancher 67 54 26 18 
191 Handling horses 28 28 20 16 
230 Living in the city| 24 —17 —24 —18 
162 Horseback riding 17 20 13 11 
214 Forest ranger* 34 37 23 18 
150 Camping oute 18 20 20 13 


| 76 191 | 230 | 162 | 214 | 150 
64 28 —34 
44 32 —18 
30 26 —25 
18 25 —10 
29 —30 
35 10 
—21 —06 
27 71 —01 
37 24 219 21 
18 23 — 13 24 39 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 
Item considered for inclusion in cluster, 


but discarded for various reasons. See text. 
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Fic. 1. Frequency distribution of item intercorrelations. 


only the Set 1 scales are used unless specifi- 
cally noted otherwise. 


Eliminaled Items 


Two restrictions were applied to the indi- 
vidual items before scale construction began. 
First, to eliminate the highly popular and un- 
popular ones, no item was used if any of the re- 


sponses, Like (L), Indifferent (I), or Dislike 
(D), had less than 15% response among à 
sample of MIG. Second, none of the items 
numbered 281-320 were included; those are 
the items grouped in tens where the respondent 
is to select the three he likes best, the three he 
likes least, and mark the remaining four in- 
different. This format is troublesome as 1 ot 


TABLE 4 
Art CLUSTER 
Item 5 | 20 | 43 | 10a | 13 | 104] 238 | 84 | 141 39 | 1a | 53 | 268 8 
5 Artist e 30 56 
20 Cartoonist 54 2 40 
43 Interior decorator 38 26 38 26 42 35 46 
104 Art 65 46 38 50 38 30 38 
163 Art galleries e deme 50) 3 
194 Decorating with flowers SEL PAE sei 36 39 33 
233 Looking at antique furniture. 30 15 26 33 42 39 33 
84 Sculptor Oy ES , , Sik lal a) 
141 Sketching pictures of wild 
animals* 45 3 27 46° 358 29 42 
39 Art museum director 48 38 39 48 50 26 31 59 38 
183 Magazines about art and 
musica 44 29 29 4 35 48 37 46 
253 Artistic men* 50 31 39 49 40 33 30 45 34 39 40 
263 Prominent artists 44 31 20 3 4% 27 36 44 33 44 50 56 
352 Music and art events vs. 
athletic events“ 35 21 19 33 42 15 18 38 26 41 50 22 34 
i E 


4 £ = Pe ated in 
Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


* New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
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33 Employment man- 


ager H 
35 Factory manager 47 
58 Manufacturer 28 54 
65 Office manager 57 48 
81 Sales manager 47 42 
206 Meeting and direct- 
ing people 33% 2 
220 Developing business 
systems 30 34 
347 Technical vs. super- 
visory responsibility | 30 —22 
196 Interviewing men 
for a job SM) ee 
42 Hotel manager 1 
85 Manager, Chamber 
of Commerce 48 34 
96 Travel bureau man- 
ager® 41 27 
177 Business methods 
magazines“ 34 37 
278 Prominent business- 
men* 28 31 


TABLE 5 
Bustness MANAGEMENT CLUSTER 


Item 33 | 35 | 38 | 6s st | 206 | 226 | s47 | 196 | 42 | as 96 | 177 | 278 


27 44 36 30 2 —15 4 J 34 
49 41 34 32 322-10 29 28 28 
35 38 25 38 -ii 27 18 R 
34 50 33 36 —24 28 40 43 
37 42 36 42 —24 26 34 Uu 
18 28 36 36 —17 39 31 2% 
34 39 41 29 —22 33 19 20 
—17 —28 37 —28 —26 —17 —18 —09 
21 40 39 45 33 —27 22 22 
22 38 38 31 20 —30 30 35 
23 45 42 32 31 -2 30 38 
25) 35) 32) ee eee 43... 39 
37 33 45 24 48 —28 34 21 31 
33 30 35 29 30 —24 29 23 26 


21 


22 37 


Note. Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


New item from the 1966 booklet, added to develop the Set 2 scales, See text. 


2% of the respondents fill in this section in- 
correctly. Another 1% or so, aggravated by 
this forced-choice format, write some hostile 
comment in the margin and refuse to fill in any 
of the items. To solve these problems, and 


in these basic scales. Only a 
affected by these restrictions. 


because these items are less efficient than the 


TABLE 6 
Law/Pourtics CLUSTER 


Item 


regular L-I-D items, this section will probably 
be removed in some future revision. In antici- 
pation of that move, these items were not used 


few items were 


41 | 47 as | st 2 | 72 10% | 399 | 323 | as6 | so 


41 Governor of a state 

47 Judge 

48 Labor arbitrator 

51 Lawyer, criminal 

52 Lawyer, corporation 

72 Politician 

160 Electioneering for offices 
359 Dog trainer vs. parole officer* 
323 Taxi driver vs. policeman? 
350 Warden of a prison? 

59 High school principal” 


51 42 31 385 51 

55 41 38 48 34 

39 48 33 22 39 

45 50 35 42 36 

34 46 30 82 22 

62 48 44 43 34 

50 33 „„ „ 28 87 
—25 —23 —26 —23 —12 24 18 
1 % Sell 23813) — 1, 13 17 
01 —00 01 —06 03 —03 —09 10 

38 hdd os ee Zo 290 37 26 —24 


10 
—12 04 


ote.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based 


1960s, Decimals omitted. 
New item from the 1966 booklet, 
Ttem considered for inclusion in cluster, 


added to develop the Set 2 


but discarded for 


scales. See text. 
various reasons. See text, 


on sample tested in 


MATHEMATICS CLUSTER 


TABLE 7 
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101 Algebra 


128 Physics 


Item 101 | 103 | 107 | 115 120 | 128 | 148 | 122 | 24 
63 61 67 75 36 
103 Arithmetic 65 48 51 79 32 
107 Calculus 62 47 56 59 44 
115 Geometry 67 50 62 65 50 
120 Mathematics 76 77 63 66 42 
50 38 55 53 48 
148 Solve mechanical puzzles*| 35 35 35 34 37 30 
122 Mechanical drawing* 26 31 31 35 36 35 41 
24 Civil engineer- 26 27 35 34 32 31 26 39 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


* Item considered for inclusion in cluster, but discarded for various reasons. See text. 


TABLE 8.—ITEM INTERCORRELATION MATRIX 


Item 19 | 56 | 94 | 16 | 189 | 190 | 122 | 132 | 148 | 187 | 121 | 185 | 188 
19 Carpenter 51 47 40 6 40 39 #55 26 35 46 36 36 
56 Machinist 47 66 47 44 62 41 55 34 55 40 48 48 
94 Toolmaker 49 73 e 37 2 õ36 335 ag 
186 Repairing a clock 35 45 40 48 48 39 42 44 58 36 60 60 
189 Cabinetmaking 57 43 49 48 e 51 40n 
190 Operating machinery e 39 49 43 60 42 45 56 
122 Mechanical drawing 32 41 42 38 44 40 52 41 35 61 38 39 
132 Shop work e (Se il6l 057 34 44 64 42 44 
148 Solving mechanical puzzles 23 33 35 40 31 37 33 30 30. 32 38ER 
187 Adjusting a carburetor F 45 9 (62) 3310059 30 37 63 9 
121 Industrial arts r 5 25 31 37 37 
185 Ilaking a radio or Hi Fiset 20 38 30 55 45 43 35 44 39 52 32 64 
188 Repairing electrical wiring 32 45 47 62 ee e 6 30 -53 
368 Have mechanical ingenuity] 27 39 e? 36 A5. 34 44 4 
13 Auto mechanic 47 59 55 46 36 53 29 47 32 62 32 36 43 
32 Electrical engineer 28 47 48 
45 Inventor 2 30 32 30 33 34 19 26 31 30 21 36 32 
55 Locomotive engineer s 34 18 34 28 20 28 
61 Mining superintendent e 30) D225 29... 27 24 20) 928 
30 Draftsman O metas ag1( G0 0 41. y 33 4i 28 A 
88 Shop foreman 33 52 54 2 2 42 33 42 25 40 34 22 31 
24 Civil engineer t 34 031 32 31 31 20 
180 Popular mechanics maga- 
zines Ye a CC 0 42 42 46 
98 Watchmaker J en sy begs) 36 26 30 0 
128 Physics 17 24 2 8 2 22 19 36 26 17 31 2 
16 Designer, electronic equip- 
mente 26 3 0 . 29 ᷣ27 39 34 26 48 46 
22 Electronics technician® 29) 4 4 36 3 9 4 
216 Looking at things in a O a EE O 
hardware stores 35 34° 36 0 37 30 20 28 36 
34 Geologist? 30 24 26 20 27 18 T 19 a 10 10 25 f 
87 Computer operator? „ eee oa 0 1 214 2 00. 1 
166 Science fiction magazines“ 09 21 16 % if 36 19 12-08, A. 15. 25 19 
Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


New item from the 1966 booklet, added to d 
b Item considered for inclusion in cluster, 


levelop the Set 
but discarded for v; 


2 scales. See text. 
‘arious reasons. See text, 
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Item Intercorrelations 


The next step in scale construction was to 
generate an item intercorrelation matrix, Using 
the responses from Strong’s group of 500 MIG, 
a Pearson product-moment correlation was 
calculated between each pair of items by as- 
signing the values +1, 0, and —1, respec- 
tively, to the Like, Indifferent, and Dislike re- 
sponses, then treating one item as the x-axis, 
the other as the y-axis. Examples of three such 
correlations are shown in Table 1, including a 
high-positive, a low-positive, and a high- 
negative correlation. As there were almost no 
large negative correlations, except between 
forced-choice items where they were artificially 
created, —.19 qualifies here as “high.” 


rok MECHANICAL CLUSTER 


33 49 28 DAAA Ssi sara mise 
39 63 45° C37 ESANA Sono nsO 
35 54 39), „ 40a ou a0 meer mn umes 
43 40 35. 339) G0, gue Gar | OU zo 
41° 36 29 % pais 400 conn isd 
„ 47° % éç— Sym Seon ee od 
30 28 31 2 
44 42 37 33 d 36. 1 1 
36 27 % "os aes gees om 
40 54 34% Vo ar ay eo ace 
42 35 24 2% 22m 2A m A0 RIN S 
43 41 42 36 30 29 30 24 32 
41 38 30 2 % sre om mes 
26 28 4i 25m 7am Foo ma 20 
35 36. 28 45) 3 27) Aon 28 
33 40 41 40 41 34 42 56 
41 25 40 30% 203 33i 25 
13 42 2 3% 1 23 
18 29 30 25 23 36 50 40 
27 33 % Pee . 
3 41 34 i l 4a 33 29 
30 34 57 34 290 36 39 30 
42 38 4 M OS meee Omen: 
3 4% „ 237 RP Stee ee? 
28 20 37 33 14 Ser 1a SS 
34 37 6 % % come cee 26— 00 
20 40 68 3% % OR eR AO Se 
26 20 2% i igs Merc tee 10m 22 25 
18 24 % 
04 24 32 13 24 % 1 22 
21 ig % op uO mene Oe oe Omens 


The next step was to decide exactly how to 
build the scales, While the authors wanted to 
identify clusters of items with high“ inter- 
correlations, they initially had little idea of 
what to call high. To gain some knowledge 
here, a frequency distribution, using all of the 
correlations in the item intercorrelation matrix, 
was constructed; it is reproduced in Figure 1. 
To provide a replication, an analogous dis- 
tribution is included for the item intercorrela- 
tions from the women’s form of the SVIB. The 
two distributions are remarkably similar. 

Several conclusions can be drawn from the 
distributions in Figure 1: 

1. The item intercorrelations were generally 
low, mostly around 0, practically all between 
+.20. 


24 37 30 
2 45 20 
18. 88 88 
28 39 28 
31 30 34 
35" OF) Sige 
32 2 35 
35 28 38 
34 29 30 
36 290 24 
ga; <20 nay 
33 34 24 
3 28 24 
32 81 
26 42 27 
a S 
3 
24 35 25 
17 34 28 
24 34 27 
7 
14 24 31 
15 21 
27 24 
24 19 
30 33 41 
35 36 35 68 
4 is 48 20 23 
F 
i Ob ae e 2 ae, 
33 of 17 26 2 14 15. 22 


1960s. Decimals omitted. 
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TABLE 9 
MEDICAL SERVICE CLUSTER 


Ttem 29 | 67 | 69 | 93 | 130 | 136 | 192 | 256 | 264 | 49 | 356 | 6t 

29 Dentist r 2 

67 Pharmacist 42 0 21 

69 Physician 38 32 823 36 

93 Surgeon 42 32 76 27 24 38 
130 Physiology Di 268 39) 33 34 238 

136 Zoology 171 2 48 15 

192 Giving first aid assistance 49 46 47 46 41 38 
256 Physically sick people* ei is 1584 

264 Outstanding scientists“ 07 07 

49 Laboratory technician* e 40. 06 25 

356 Superintendent of hospitals „„ 6 20013 28 04 OF 06 

64 Psychologists ess 05 14 14 10 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested i 
1960s, Decimals omitted. 
* Item considered for inclusion in cluster, but discarded for various reasons. See text. 


2. The distributions were slightly positively In the initial matrix, the highest were .79 be 
skewed. This mild effect could be caused by tween the items Mathematics and Arithmetic, 
any of several factors such as a slight response and .78 between Surgeon and Physician, only 
Set or overrepresentation of some item-content 5% were above .25, and only 1% above 8. 
areas. This strongly suggests that the items are rela- 

3. There were not many high correlations. tively specific and that the respondents pay à 


TABLE 10 


MERCHANDISING CLUSTER 


a clothing storeb 24 22 20 9 22 0 20 84 19 20 28 22 20 12 


Note. Correlati bi ia 55 s ested in 
19608. Decimals Shieh diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample t 


^ New item from the 1966 booklet, added to develop the Set 2 scales, See text. 


b Item considered for inclusion in cluster, but discarded for various reasons. See text 


Item so | o sı [z9] 18 a|s 220 os | 92 | os || 96 [i77 |28 
80 Retailer 59 48 e s 0 48 44 30 
99 Wholesaler 55 58 44 48 33 45 39 47 41 
81 Sales manager 52 48 47 48 34 44 44 50 38 
219 Buying merchandise 
for a store 53 49 50 1 1 22 26 
18 Buyer of merchandise 54 52 51 58 36 33 44 44 32 
42 Hotel manager 36 30 38 38 38 32 232 
85 Manager, Chamber of | 34 36 42 36 36 38 30 47 41 
Commerce 
220 Displaying merchan- 
dise in a store 21 3 3% 33 32 26 
65 Office manager 37 39 42 36 41 38 45 27 35 
92 Stockbroker 29 34 37 30 31 26 30 20 26 
68 Public relations mane 35 35 53 38 41 n 31 
346 Work in an import- 
export business 28 80 36 31 3 33 24 28 20 21 37 
96 Travel bureau man- 
8 5 34 31 32 32 33 43 | 
177 Business methods a 
magazines* 32 36 45 34 / 21 19 2¹ 
218 Looking at things in . 


great deal of attention to the unique aspects of 
each item. 

4. There were no large negative correlations. 
This indicates that, at the item level, there is 
little tendency for people who like one item to 
all dislike another. (This is peculiar, for large 
negative correlations certainly appear at the 
scale level, ranging up to —.88 between the 
Social Science Teacher and Engineer scales.) 
In constructing the basic scales, only a few 
items with negative correlations were included, 
even though a lower (absolute) value of cor- 


After the matrix had been calculated, and it 


Item 
62 Musician 50 
63 Music teacher 56 
66 Orchestra conductor 58 81 
167 Symphony concerts 39 34 
183 Magazines about art and music* 43 41 
352 Music and art events vs. athletic events? 34 27 
152 Playing the piano* 49 37 
157 Jazz concerts* 26 20 
149 Religious music 28 30 
261 Musical geniuses* 43 34 
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TABLE 11 
Mutrrary Acrivmies CLUSTER 


Item |4 |123 | asa |210 | 237 
4 Military officer 49 #42 59 
123 Military drill 52 ó 68 
151 Drilling in a military | 
company 81 61 
210 Drilling soldiers 59 68 74 
$2 44 43 47 


237 Military men* | 


Note.—Correlations above diagonal from Strong's 500 MIG 
tested in 19308; those below diagonal based on sample tested in 
1960s. Decimals omitted. 

a New item from the 1966 booklet, added to develop the Set 2 
scales. See text. 


TABLE 12 
Music CLUSTER 


| 92 | 65 | 6 |107 | a83 | 32 | ase | as7 | 199 | zor 
E — — 


57 40 
32 27 
34 
47 
46 54 
35 48 50 
2 30 30 22 
28 25 30 08 29 
26 34 32 21 21 13 
4 39 46 33 35 23 25 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


* New item from the 1966 booklet, added to develop the Set 2 scales. See text. 


TABLE 13 
NATURE CLUSTER 


telation was accepted. 


Item 37 | 50 | 102 | 106 | 125 | 136 | 147 | 193 | 230 | 326 | 5. 76 | 214 

37 Farmer 40 6 33 35 26 34 40 —34 -14 19 64 
50 Landscape gardener 37 40) 335 „ eee B 
102 Agriculture 58 42 42 44 34 39 42 18 —08 29 44 
106 Botany 21 28 32 51 50 44 39 —09 —10 22 22 
125 Nature study 25 33 35 47 47 64 45 —08 -16 32 25 
136 Zoology 1301:22 20 57 45 41 30 01 —05 21 24 
147 Bird watching 24 31 23 36 48 34 48 17 —20 30 28 

193 Raisi d ; 
ne Ao a vege: Pile os ae ae Ge ee at -32 4 2 
230 Living in a city 24 —08 —17 00 —03 01 —06 —03 2 901 —30 

326 Selling things house to 

hose n e 24 —34 22 —19 —22 12 20 mo 1 i —15 aH 

4 Decorating with flowers „ au be 2 22 7 22 vals 

cher Geese oe dre lo LG, 30) 21 = 

FFFU᷑ III!!! 20") 20 19 24 hf 37 


| 214 Forest rangers 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


19608. Decimals omitted. 
New item from the 1966 booklet, a 


dded to develop the Set 2 scales. See text. 
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TABLE 14 


OFFICE Practices CLUSTER 


Item as | 21 | 25 e | 99 | 105] 135 | 36 |321 | 228 sw 

15 Bank teller 6 3 39 34 30 40 32 
21 Cashier in bank 81 27 
25 City or state employee 28 20 „ % 16) 5 31 30 
65 Office manager 39 40 22 021 
74 Private secretary 3 1 358 32 
80 Retailer 32 30 13 37 19 2 
99 Wholesaler e 639! e 33 23 
105 Bookkeeping 8 I 34 2 28 209 39 
135 Typewriting neee l 15 28 
36 Income tax accountant® e 123) 55, 19 
321 Airline pilot vs. airline 

ticket agent® —18 —20 —15 —13 —08 —13 —04 —15 —07 —14 
208 Making statistical 

charts? ne 09, 27 15 32 —04 
389 Pay attention to details? —07 —03 —11 —10 —08 —06 —02 —14 —12 —13 14 5 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
Item considered for inclusion in cluster, but discarded for various reasons. See text. 


required 3 hr, with a CDC 1604 computer, 
various searching techniques were used, with 
the further aid of the computer when possible, 
to locate smaller matrices containing perhaps 
5-25 items, all having high intercorrelations. 
Using Figure 1 as a guide, high was defined as 
above .30 though it was sometimes necessary 
to drop lower. The absolute minimum was set 
at .20 but even that was violated occasionally, 
especially when the correlations were negative. 


Item Composition of the Basic Scales 


Twenty-two basic scales were finally con- 
structed; item intercorrelation matrices for 
each of them are listed in Tables 2-23. The 
items in the tables were sometimes abbreviated 
for easier presentation here. 

In these matrices, two correlations are prt 
sented for each pair of items. Those above the 
diagonal were generated from Strong’s 500 
MIG, tested during the 1930s, and are the 


TABLE 15 
PUBLIC SPEAKING CLUSTER 
Item 131 | 199 | 72 | 2% 41 | 160 | 75 | 68 | 176 | — 

131 Public speaking 66 34 

199 Making a speech 67 39 A A 

72 Politician 42 43 22 51 

26 Minister, priest, or rabbi 25 29 21 23 

41 Governor of a state 40 47 62 27 
160 Electioneering for offices 38 43 65 18 50 

75 Radio announcer? 34 32 32 24 30 29 

68 Public relations mana 38 28 39 21 34 35 39 
176 Telling jokes? 26 22 19 06 14 21 24 20 
367 Prefer working alone to working 

on committees? melee 18408 E s -13 05 


Note. Correlations above diagonal from Strong's 500 MIG tested in 19308; those below diagonal based on sample tested 1! 


1960s. Decimals omitted. 


New item from the 1966 booklet, added to develop the Set 2 scales, See text, 


> Item considered for inclusion ir cluster, but discar 


ded for various reasons. See text, 
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TABLE 16 
RECREATIONAL LEADERSHIP CLUSTER 


Item 


7 e ie . 12 | 380 | 230 | 60 | 382 | 174 


7 Athletic director 


S8 44 
70 Playground director 55 40 
79 Reporter, sports page 51 43 
127 Physical education 62 42 40 
143 Boxing 2 
171 Sports pages in newspapers 3 V 148 
12 Auto racer 13 09 14 
350 Playing baseball 29 20 17 
280 Athletic mene 44 27 338 
60 Professional baseball playerb 89 38 57 
352 Music and art events vs. athletic 
eventsb —41 —16 —30 
174 Skiing? 18 f 


“u 32 41 26 25 34 

w BY a BR Do 

2% 31 49 260 24 23 

26 27 16 26 31 

32 3 2 20 19 

38 20 20 14 31 

10 24 04 20 10 

31 19 08 12 22 

48 2 39 08 20 

ae 7 186 3883 39 
—31 —18 —49 —10 —07 —33 —37 
17 11 a 2 30 20 OF 03 


Note. Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 
* Inadvertently left out of Set 1 scales. 


b New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
e Item considered for inclusion in cluster, but discarded for various reasons. See text. 


correlations used here to cluster the items into 
scales. The correlations below the diagonal are 
based on a recently tested sample of MIG who 
filled in the 1966 booklet. (Many of them were 
actually tested in 1964 and 1965, using a pre- 
publication version of this booklet.) This 
second group, which also included 500 men 
from a variety of occupations, provided a 
replication to recheck the homogeneity of the 
selected clusters. In general, the intercorrela- 
tions remained high in the second group, indi- 
cating that these clusters are stable over time 
and from one group to another. 

The latter group also provided item inter- 
correlations for the new SVIB items added in 
the 1966 revision, and several items, those so 


noted, were added to the scales after reviewing 
these figures. The expansion of the scales with 
these new items constitutes Set 2 of the SVIB 
basic scales. 


Considerations in Scale Construction 


It is difficult to recapture exactly the 
methods used in searching out these sub- 
matrices; a combination of computer searching, 
art, hunches, and intuition was employed to 
decide which items should be grouped together, 
and it would be difficult to exactly replicate 
these techniques. To demonstrate some of these 
decisions, items at the bottom of some matrices 
are noted; these were considered for inclusion 
in the scale but were rejected for one reason or 


TABLE 17 
RELIGIOUS ACTIVITIES CLUSTER 


Item 


26 Minister, priest, or rabbi 
255 Worker in YMCA 
Religious people 

124 Bible Ae 

149 Religious music* 

161 Going to church* 

169 Church young people's group" 
181 Reading the Bible* 


59 


56 51 


Note. —Correlations above diagonal from Strong's 500 MIG tested in 1 
ded to develop the Set 2 scales. See text. 


1960s, Decimals omitted. 
New item from the 1966 booklet, ad 


930s; those below diagonal based on sample tested in 


12 CAMPBELL, BORGEN, Eastes, JOHANNSON, AND PETERSON 
TABLE 18 
Saus CLUSTER 
= ———— — — 7 l= 
Item 11 so | 95 |197 |326 | 335 | os [am 

11 Auto salesman S 232 2 9 35 

S4 Life insurance salesman 43 © 3 3% 41 42 41 

77 Real estate salesman Shit 32 3 41 580 44 44 

80 Retailer 34 35 41 48 39 38 30 

81 Sales manager 3 41 B 52 47 48 54 

90 Specialty salesman 3 50 51 45 57 54 43 

95 Traveling salesman 4 44 4 #38 52 56 51 

197 Interview prospects in selling 4 4 47 36 66 59 (52 

326 Selling things house to house* 3 33 2 18 33 38 

335 Definite salary vs. commission on 

what is done* —16 —15 —20 —07 —32 —28 
68 Public relations man- 31 35 40 35 53 41 
360 Appraise real estate” rer 21 18 28 20 —19 19 


i N above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested & 
* New item from the 1966 booklet, added to de the Set 2 scales. See text. 
> Item considered for inclusion in cluster, but disc: for various reasons. See text. 


another, usually because of low intercorrela- 
tions but occasionally because of content. 

To guide these decisions, the following two 
standards were closely followed: first, each de- 
cision was based on statistical evidence so that, 
for example, no two items were clustered to- 
gether unless it was empirically defensible; 
second, the other major concern was with 


eventual interpretation. Thus, considerable 
attention was paid to item content. Occa- 
sionally, an item with relatively high correla- 
tions with a cluster of items was not included 
with them because the item simply seemed out 
of character. One such example was the item 


34 42 

2 38 32 

38 49 21 17 

Sculptor in the Science cluster (Table 19). 


TABLE 19 
SCIENCE CLUSTER 


worker 50 
212 Doing research 
23 Chemist AE IO E es 
3. 9 
E S440 s eg 
39 37 44 
5 Chemistry ne a ar + 25 25 
Astronomer 26 
10 Author of technical 3 eh 
book 27 26 30 
128 Sa ee 21 36 2 32 50 34 33 
107 us 3 22 34 22 10 30 30 
136 Zoology 33 28 31 25 28 32 30 
5 Geology Se 28 26 28 24 27 34 26 
tronicequipments | SO 41 55 46 3 
22 8 tech- 7 
ye =| 2 PRS 2 . 
ist- 32 47 
346 Business vs. re- Ep 
search laboratorya | —50 —47 —35 —33 —27 =26 -31 
264 Outstanding sci- 
entists* 31 27 30 25 286 24 32 
87 Computer opera- 
ato} 29 23 21 34 18 12 16 
64 Psychologist® 22 15 12 14 08 20 20 


Note. —Correlations above 
1960s. Decimals omitted. 


34 [sso %% | 


Although that item correlated fairly high with 


4 34 22 27 0 
33 36 38 30 
21 36 31 20 
50 38 36 30 
28 30 31 34 
29 32 23 20 
44 30 32 
55 19 15 
27 21 41 
30 20 43 
41 31 20 22 
35 25 19 21 68 
34 24 40 51 36 37 k 
—28 ~23 —20 —12 —30 —31 —21 
28 17 28 24 23 19 18 —20 
18 17 06 09 30 36 17 -16 14 
12 08 19 12 10 06 17 —00 10 08 


diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


New item from the 1966 booklet, added to develop the Set 2 scales, 
b Item considered for inclusion in cluster, but discarded for cia eee text, 


Social problem movies 

28 Contributing to charities 
Raising money for a charity 

$59 Dog trainer vs. parole officer 


Decimals omitted. 
for inclusion in cluster, but discanted for 


(.37), it was not included with them because 
it just did not seem to belong. 

Another example is shown in Table 7 which 
has the item intercorrelations for the Mathe- 
ics scale. At the bottom of the table are 
‘several items which might have been included 
with this scale if the size of the intercorrela- 
tions had been the only consideration. How- 
‘ever, they were not added because they would 
have clearly diluted the homogeneity of the 
Mathematics cluster. Such decisions were not 
‘made often, nor lightly, but all of the decisions 
were definitely not left in the hands of the 
computer most of them, though, were. 
Scanning the item intercorrelation clusters 
‘makes it clear that some item-content areas 


82 School teacher 68 
27 College professor 63 
63 Music teacher 34 20 
203 Teaching adults s u 
202 Teaching children 38 22 
S Librarian = 4 
High school principal 
387 Have patience when teaching others* 1 a 


359 Dog trainer vs. parole officer 


» Decimals omitted. 


Bastc IK Scares ror rue SVIB 
TABLE 2 


item from the 1966 booklet, added to develop the Set 2 scales. See 
considered inch Variogs meras. 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; 
develop the Set 2 scales. See text. 
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n bad 
-H -1S -H 
4s 3% 0 10 —04 
18 21 R „ -10 OO 


—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below digonal basni os sampir tested ia 


are better represented in the SVIB item pool 
than others. In the mechanical area, for 
example, many items were available, while in 
the Religious Activities area only three items 
were available for the Set 1 scales. Normally 
three items would not be considered sufficient 
to constitute a scale but because it was pos- 
sible to generate a longer scale for the Set 2 
scales such a scale was constructed. 

While most of the possibilities for such 
scales in the SVIB item pool have been ex- 
hausted, no claims are made to have exhausted 
the domain of interests. When working with 
item intercorrelations, it becomes very ap- 
parent that a cluster of interrelated items will 
emerge if enough items are available concerning 
any one type of activity. But if an activity is 
not represented in the SVIB item pool, then 


TABLE 21 


48 38 
37 H 29 38 
18 21 39 
23 45 18 
24 53 25 
35 18 17 
29 32 28 31 
13 34 26 06 18 
-02 —20 —11 -0 —24 —17 


those below diagonal based on sample tested in 


608. De fi 
New item from the 1966 booklet, added but discarded for various reasons. See text. 


item considered for inclusion in cluster, bu 
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TABLE 22 
TECHNICAL SUPERVISION CLUSTER 


Item 35 k 58 | 61 ss | 347 

35 Factory manager | —10 

58 Manufacturer 54 37 g —11 
61 Mining superin- | 

tendent 40 38 50 10 

88 Shop foreman 45 35 44 09 
347 Technical vs. sup- | 
ervisory responsi- | 

bility —22 -17 01 —02 


Note.—Correlations above diagonal from Strong's 500 MIG 
tested in 1930s; those below diagonal based on sample tested in 
1960s. Decimals omitted. 


no scale to measure interests in that area can 
be constructed. This is probably not a signifi- 
cant problem because the SVIB item pool does 
have considerable diversity, especially in occu- 
pational activities. However, certain omissions 
can be quickly identified by scanning analogous 
intercorrelation matrices which have been de- 
veloped for the women’s SVIB. For example, 
one scale available for the women’s form but 
not the men’s has been labeled “Homemaking” 
and contains the following items: 
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Doing your own laundry work 
Cooking 

Sewing 

Preparing dinner for guests 
Trying new cooking recipes 
Home Economics 


Whether or not such a scale would be useful 
on the men’s profile can only be an academic 
question here; because these items have never 
been included in the men’s booklet, this scale 
is not available for use in comparing male occu- 
pational groups. 

In constructing these scales, some difficult 
decisions arose over the issue of scale inter- 
relationships, the most difficult being that of 
scale independence. Most psychometricians, 
and probably all factor analysts, would likely 
argue that these scales should be as independ- 
ent as possible; certainly they would prefer 
that there be no overlapping items. While 
there might be some statistical advantages in 
such procedures, it is not at all clear that the 
resulting scales would be the most accurate re- 
flection of the structure of interests, as some 
interest dimensions are related, and some SVIB 
items tap more than one dimension, 


ITEM One such quandary appeared in the three 
Caterer scales: Sales, Merchandising, and Office Prac 
Cook tices. The correlations between these scales are 
Housekeeper , high: .78, .63, and .79, respectively, and there 
Home Economics Teacher is probably only one underlying dimension— 
TABLE 23 
WRITING CLUSTER 
Item 9 | 57 | 7 | 53 78 11 40 u | 172 E 10 
9 Author of novel 1 
57 Magazine writer 65 8 232 
71 Poet 52 80 0 22 51 
31 Editor 2 7 . 43 20 
53 Librarian 28 36 37 34 32 
78 Reporter, general AD, 8 26 24 
119 Literature 51 42 48 43 30 40 26 54 40 
40 Foreign correspondent . 36 29 18 
112 English composition 48 47 40 44 35 40 57 32 30 
172 Poetry „ „ «42, UEA: * 98 41 
165 Writing a one-act plays 58 8 55 50 / 45 43 45 46 44 
170 Biographies ,,, P 40 
75 Radio announcer 23 34 22 3 15 i e e e 


Note. —Correlations above 
1960s. Decimals omitted. 


diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 


b Item considered for inclusion in cluster, but di 


iscarded for various reasons. See text. 
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business. Yet, there were three fairly distinct 
clusters of items and, as they clearly merit dif- 
fering interpretations, they were retained as 
separate scales. Whenever such an arbitrary 
decision had to be made, the overwhelming 
consideration was to improve the final inter- 
pretability of the scales, not to satisfy inter- 
mediate psychometric requirements. The even- 
tual impact of these decisions can be seen in 
Table 31 where the scale intercorrelations are 
reported; some of the correlations were quite 
high—in the .70s—and that is undesirable. 
Still, one cannot have it both ways; either you 
take the clusters as they stand, or you arti- 
ficially eliminate overlapping items. The 
authors’ preference was for the former. 


“Personality” Scales 


The scales which appeared first in the scale- 
construction stage were those that researchers 
in interest measurement have learned to 
expect: Sales, Science, Mechanical, Social 
Service, and so forth. But the authors were 
also interested in constructing scales that 
would resemble the more traditional per- 
Sonality dimensions such as aggressiveness or 
compulsivity. 

Several attempts to build such scales from 
the SVIB item pool were singularly fruitless. 
Neither computer algorithms nor intuition 
located clusters of items that held together 
Statistically. To illustrate the difficulties, inter- 
correlations for items selected to represent the 
dimensions of autonomy and compulsivity are 
shown in Tables 24 and 25. The level of homo- 
geneity of these “clusters” falls far short of 
the minimal standards of statistical coherence 
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which were followed in forming the occupa- 
tional oriented basic scales, and these per- 
sonality clusters were dropped. These failures 
might be attributed either to a lower cohe- 
siveness of personality clusters, or to the in- 
adequacies of the SVIB item pool in the per- 
sonality domain. 

Finally, to gain some experience with such a 
personality scale, the Adventure scale was 
forced into existence. Items with a derring-do 
flavor were separated out, their intercorrela- 
tions were scanned, and those with at least 
mild positive correlations, generally larger than 
10, were clustered together. As will be seen 
later, the resulting scale worked reasonably 
well—a group of astronauts scored highest, 
2.50 SDs above the lowest scoring occupations 
of school superintendents and mathematicians 
—but, still, the scale seems to have less occu- 
pational relevance than the other more homo- 
geneous scales. For example, on the other 
scales, scores increase with age, especially with 
occupational experience, but scores on the Ad- 
venture scale decrease with age. In general, 
attempts to build homogeneous personality 
scales were unsuccessful. 


Scale Reliability 


Relatively few items are available in the 
SVIB for any single content area; thus, most 
of these scales are short and one of the first 
concerns here was with scale stability. Building 
short scales, no matter how pure, could con- 
ceivably result in scales with unacceptable re- 
liabilities. To check this, several scales of 
varying lengths were constructed and their 
long-term reliabilities were compared. Six 


TABLE 24 


Autonomy CLUSTER 
JJ reel eee a a 


Item 299 | 335 | 336 | 351 | 338 | 367 392 
299 Freedom in . . . doing work —16 13 05 = 
335 Commission on what is done vs. salary | —14 —25 E ie 
336 Work for yourself 18 =29 1 — 95 
351 Amusement alone . . .* 12 —00 n i 
338 Work for self in small business“ —10 25 805 ea, 
367 Prefer working alone 06 ot 5 AAAY Hs 
392 Dislike taking orders 0g —06 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; t 


1960s, Decimals omitted. 
Item with negative weight. 


hose below diagonal based on sample tested in 
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TABLE 25 
CoupvrsivIry CLUSTER 
— — — — r = E 
Item 105 zas m | ss | sat | sss | sso | se | sos | 373 | sas Tam 

105 Bookkeeping 20 26 -0i -0 -02 17 

223 Methodical work } 2 14 —10 —21 —02 30 

227 Saving money 22 14 — 24 -03 —08 15 

3M Playing safe* — 10 —14 —10 03 14 —06 

M41 Work with many detail -0 -27 -03 O -08 —19 

MA Similarity in work* 10 -10 -08 19 OO 10 

380 Plan work in detail | 12 28 18 -0% —18 —01 

362 Make decisions immediately 0o 11 -0 2 09 7 —10 

366 Keep detailed records of expenses 19 21 21 —18 -11 -13 20 11 

373 Am always on time with my work} 06 14 11 —11 —01 -0 25 O7 13 

385 Am slow going and sure —01 10 08 —21 O1 -08 12 -2 06 -02 

389 Pay attention to details very little] — 14 20 —12 14 29 O4 —37 17 —32 13 —10 


; omitted. 
* Item with negative weicht. 


content areas were selected and two test-retest 
samples were used. The first samples included 
102 members of an Army Reserve unit tested 
twice over a 30-day interval; the second in- 
cluded 191 Stanford University seniors tested 


TABLE 26 


RELIABILITIES ror Some EXPERIMENTAL 
Basic Scares 


Content area 


Science 


Mechanical 


Sales 


Writing 


Public speaking 


Mathematics 


Note,—Median test-retest correlation for 53 lar S' 
scales: for 30 days, 91; for 22 yr., 67. N 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested le 


first in 1927 and retested in 1949. Test-retest 
correlations for both of these samples for 
several experimental scales of differing lengths 
are reported in Table 26. Within each content 
area, the scales were formed from the “best” 
items available; thus, the 8-item Science scale 
has those items with the highest intercorrela- 
tions, the 16-item scale contains the initial 
8, and the next 8, and so forth. 

Several conclusions can be drawn from these 
reliability calculations: 


1. The relationship between scale length and 
reliability was approximately the same, 
whether split-half or test-retest reliability co 
efficients were used. As the major concern wi 
the SVIB is long-term stability, test-retest 
reliability is more important, and that sta. 
tistic was used for the other comparisons. 

2. In general, the expected relationship ap- 
peared between scale length and reliability 
the longer scales were more reliable but the 
relationship was by no means perfect. The 
shortest Mechanical scale (14 items) was more 
reliable over 22 yr. than the longest (25 items) 
The most reliable scale over the 22-yr. period 
was the 5-item Mathematics scale. 

3. There was considerable difference between 
the content areas in the reliability of scales 
equal length. Thus, the 8-item Sales scale, T- 
item Writing scale, 7-item Public Speaking 
scale, and S- item Mathematics scale had 229, 
test-retest reliabilities ranging over a Co 
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siderable span: 47, 59, .67, and 78, respec 
tively. This range could be caused either by 
differing degrees of homogeneity within the 
sales, the nature of the interest clusters in- 
volved, the composition of the test-retest 
samples, or all three factors. 

4. In general, these reliabilities were roughly 
equivalent to those of the regular SVIB occu- 
pational scales. The loss due to scale shortness 
was apparently offset by the gain attributable 
to scale homogencity, (The regular occupa- 
tional scales have roughly 60-75 items.) 


The general conclusion reached from this 
series of studies was that short scales, if their 
content is homogeneous, can be as reliable as 
long scales. Thus, the conflicting goals of purity 
of content, measured by item intercorrela- 
tions, and longest possible scale length were 
Stressed in the scale construction. 

Further information on the stability of these 
Scales is reported in a later section. 


Ilem Weights 


Once the items had been selected for each 
scale, scoring weights were established by 
assigning the weight of +1 to the Like re- 
Sponse, and —1 to the Dislike response. In the 
few cases where the item intercorrelations were 
negative, these weights were reversed. 


Cranny’s Work in Factor Scales 


In a closely parallel but independent project, 
Cranny (1967) has factor analyzed the items 
of the SVIB; he described his project as 
follows: 


The responses of 500 men in general to each of the 288 
items were intercorrelated and the items arranged in 15 
clusters on the basis of their intercorrelations. Thur- 
stone’s (1947) multiple group method of factor analysis 
was followed. One factor was extracted from each cluster 
and the angular cosines among the 15 oblique factors ob- 
tained. Four factors were extracted from the 15 by 15 
matrix of angular cosines using Lawley’s (1940) maxi- 
™mum-likelihood procedure. The residuals were reduced 
to less than .00005. 

The four factors were rotated obliquely and Wherry’s 
(1959) hierarchical factor solution was applied to yield 
four sub-general and 15 group factors. The factor load- 
ings on the 19 factors of each of the 288 items were com- 
puted and a 288 by 288 residual table was obtained. 

The items with loadings on each factor are listed and 
descriptive labels are suggested for each factor. One of 
the group factors was not sufficiently defined to permit 
interpretation [p. 1]. 


TABLE 27 
Leni, oF Ceasory’s veses SVIB Seutas 


Pubic Contact t Sprakiog 

Writer * Writing 
. 

Fa Agriculture 


Sa apes, 


There is considerable agreement between 
Cranny's factor scales and the scales derived 
in the current project, which is interesting 
because the scale-construction techniques dif- 
fered somewhat. Cranny’s was essentially a 
statistical approach; in contrast, the methods 
of this study were much more intuitive. 

A comparison of the two sets of scales ap- 
pears in Table 27 where the scales are matched 
up by name. This comparison exaggerates the 
similarity slightly as some scales with the same 
names have slightly differing item content. 
Still, the resemblance is considerable and 


reassuring. 
Many of the differences were due to greater 
concern in this present study for eventual 
interpretability. The treatment of the business 
area is one example; Cranny has two scales: 
Business Management and Sales, while this set 
contains five, covering approximately the same 
item content: Business Management, Sales, 
Technical Supervision, Merchandising, and 
Office Practices. There is no simple way to 
determine which approach is better. Cranny's 
scales are probably statistically more de- 
fensible; the present authors’ scales are 
probably easier to interpret—so the choice 
between them is largely a matter of taste. 
Cranny's report, which is an impressive 
amount of work for a doctoral dissertation, 
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contains no normative or other psychometric 
data on his scales. 


Evaluation of the Basic Scales 


In developing these scales, both statistical 
precision and common sense were used. The 
success of these techniques must be deter- 
mined, not by studying the scale-building 
techniques themselves but by the efficacy of 
the resulting scales. To this end, a great deal 
of information follows on the reliability, va- 
lidity, and other characteristics of these Basic 
Scales. The results seem meaningful and, after 
studying these data, users of the SVIB should 
have more confidence in their interpretations, 
and researchers should be able to home in more 
precisely on the unanswered questions. 

One result of this work is that now some- 
thing more is known about the organization of 
interests. The clusters of items were not 
forced; for the most part they fell out of the 
intercorrelation matrix relatively cleanly and 
their makeup reflects something of the basic 
dimensions underlying the items. In this sense, 
some clusters that did not appear are note- 
worthy. The failure of personality clusters to 
appear has already been noted. Another inter- 
esting omission was anything resembling a 
“prestige” cluster—it might have contained 
items such as corporation president, governor, 
and perhaps college professor. This might indi- 
cate that status per se is not an important 
determinant of item choice. 

The clusters that were formed merit more 
extensive study as they have some psycho- 
logical integrity, so to speak. More is said about 
this in the section on construct validity. One 
observation can already be made; that is, item 
clustering tends to follow vertical, not hori- 
zontal, divisions in the occupational world. 
Thus, the Science scale includes high and low 
laboratory positions; the Office Practices scale 
includes high and low office positions. This 
strongly suggests that the unifying feature is 
the activity involved, not the more tangential 
aspects of the work. 


NORMS FOR THE Basic SCALES 


Once the basic scales were developed, norms 
had to be developed for interpretation; simply 
calculating the number of Likes that an indi- 
vidual marks in each scale would be of little 
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value because the areas differ in popularity, 
To make the scores meaningful, they must be 
standardized in some manner that will permit 
ready comparisons between individuals and 
between scales. One common way to do this— 
and the method that has been employed here— 
is to convert the raw scores into standard 
scores with a known mean and standard devia- 
tion, usually 50 and 10, respectively. To do 
this, it is necessary to score some reference 
sample on each scale, then use the resulting raw 
score mean and standard deviation in a raw- 
score-to-standard-score conversion formula. 
The question as to which reference sample 
should be used is difficult to answer. The regu- 
lar occupational scales are normed against 
whatever occupation they are based on but 
that is not possible here as these scales are not 
specific to any one occupation. For these basic 
scales, there are various arguments for using 
amore general sample such as college freshmen, 
or high school seniors, or college-bound high 
school seniors, or perhaps a random sample of 
young adults, or even a stratified sample of the 
entire male population. Each would have its 
peculiar advantages and disadvantages. 


Description of Norm Sample 


The selection of a norming sample was again 
a result of concern with eventual interpreta 
bility. The men in this sample have been 
tested twice, once as teenagers and once a8 
adults, They were first tested by Strong in 1930 
when they were 16 yr. old, in his cross-sectional 
study of the relationship between age an 
interests (Strong, 1931). They have been le. 
tested in 1966-67 in a longitudinal study of the 
relationship between teenage interests an 


TABLE 28 
EDUCATIONAL LEVEL or Norm GROUP 

Educational level n % 
Less than high school graduate 94 u 
High school graduate 139 2 
Some college 170 26 
BA, BS 149 25 
MA, MS 50 : 
MD, DDS, LLB 26 5 

PhD, EdD 19 
Total 647 100 
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adult occupations. (When retested, their mean 
age was 51.8, with a standard deviation of 
1.50. To describe them as “16-yr.-olds” when 
first tested and “‘52-yr.-olds” when retested 
provides brief and essentially correct labels.) 
For this reason, they make an especially good 
norm group as norms can be established, using 
the same people, for both teenagers and adults. 
This advantage of being able to compare 
directly 16-yr.-olds with 52-yr.-olds, using the 
same sample, is offset slightly by the dis- 
advantage that the 16-yr.-olds were drawn 
from the 1930 population, not the current gen- 
eration, but this is a small price to pay for 
having directly comparable groups. 

A more precise description of the sample is 
not possible, for Strong never wrote down his 
specific 1930 sampling methods. Nevertheless, 
it is apparent that he did try to obtain a wide 
and diverse sample of teenagers, and it is not 
grossly inaccurate to consider this a cross- 


TABLE 29 
Jon Satisraction or Nonm Grour 


i 
% choos- 


Item ing item 


a. It is exactly what I have wanted to do. 18 
b. It is approximately what I have wanted 
to do. 28 
c. It is something I entered, due to circum- 
stances more or less beyond my control, 
but I am now satisfied in it. 40 
d. It is a career which is tolerable but not 
really what I would like to do. 2 
It is an unsatisfactory and unrewarding 
career, 
It is a career that I strongly dislike and 
I wish I could leave for some other. 


p 


— 


0.3 


Note.— The instructions were as follows: Please check one 
mo following which best describes how you feel about your 


section of males within the specified age ranges, 
with perhaps a mild underrepresentation of the 


TABLE 30 


TEENAGE AND ApuLT MEAN SCORES For Bastc SCALE NORM GROUP AND CORRELATIONS 
BETWEEN SET 1 AND SET 2 SCALES 


Adult scores (age 52)* Teenage scores (age 16)>) 
Scale Set 1 Set 2 r 
M SD SD 

Public Speaki —0.93 2.67 —1.41 3.94 04 
Law/ Pa —0.82 3.39 —141 4.08 97 
Business Management 0.43 5.03 1.03 6.01 98 
Sales —2.58 4.15 —3.73 5.11 96 
Merchandising —0.87 4.89 —0.58 6.54 98 
Office Practices —1.63 3.63 —2.72 4.27 98 
Military Activities —1.12 2.02 —1.02 3.04 150 

Technical Responsibility 0.56 2.08 0.56 2.08 4 
Mathematics 2.39 3.35 2.39 3.35 1.00 
Science 1.90 5.23 2.78 7.31 98 
Mechanical 4.54 10.97 5.40 12.07 1.00 
ature 2.67 5.06 3.08 5.44 99 
Agriculture 1.12 3.23 1.12 3.23 1.00 
Adventure —0.80 2.48 —0.23 3.53 90 
Recreational Leadership 0.80 3.66 1.92 4.86 He 
Medical Service —0.02 3.28 —0.02 3.28 x 
Social Service —0.87 2.71 —0.90 3.35 S 
Religious Activities —0.76 501 211 305 al 

Teachi —0.29 . 5 "i „ 
hae —0.28 27 —0.63 4.68 A 
Art —0.54 4.16 —1.28 6.63 = 

Writing —0.66 5.09 —0.72 6.02 i 


* Raw scores. (The adult norm group has a st 
> Standard scores. 


andard score M of 50, SD of 10 on all scales.) 


TABLE 31—Basic SCALE INTERCORRELATIONS 
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S288 2888288228888 lower end of the educational and socioeconomic 
nnn ladder. | 

Some basic demographic data are available 
for the total group. Their educational level is 
listed in Table 28. Practically all (86%) are 
high school graduates; about two-thirds have 
had some college; over a third have degrees. 
One in eight have earned graduate degrees. 
Ninety-three percent were married in 1966. 
They had, on the average, 18-yr. experience 
in their jobs. Their feeling toward their work 
is reported in Table 29 which lists the percent- 
age checking various choices in answer to a 
question on job satisfaction. 


Raw-Score-to-Standard-Score Conversion 


The mean scores for this norm group on both 
sets of scales are listed in Table 30. For the 
SS SSS SNN 888888888 adults, only the raw-score statistics are listed; 
these are the figures that are used to convert 
an individual’s raw scores into standard scores 
by using the following formula: 
ea 


sp 2° + 50 = individual’s standard score 


= 301). 


= 647), 


where X = individual’s raw score, M = adult 
norm group raw score mean, SD = adult norm 
group raw score standard deviation. This con- 
version, which is a simple linear transformation 
and does not change the shape of the distribu- 
tion, has been used on all the remaining scores 
reported in this report, including the teenager 
scores of the norm group. The net result is to 
convert all scores into a distribution where 
this adult norm group has a mean of 50 and 
standard deviation of 10. This permits an 1m- 
mediate comparison between the individuals 
score and the mean score of the norm group, 
and makes other comparisons possible also— 
such as comparing a person’s score on the Law 
Politics scale with his score on the Science 
scale. 

The adult group was used here because both 
Set 1 and Set 2 scores were available for them; 
because the teenagers were tested in 1930 Wi 
the old booklet, only Set 1 scores are available 
for them, and the means are reported in 
standard score points. As can be seen, thet 
means on almost all of the scales were slight 
below the adult mean of 50. The only exc 
tions were the Adventure and Recreation 
Leadership scales. 


sponsibility 


Set 2. Correlations below diagonal calculated on cross-section sample of 52-yr-old men (N 


Scale 


25 
10 
35 
22 
16 
08 
04 
14 
Note.—Set 1. Correlations above diagonal calculated on sample of Stanford University seniors, 1927 (N 


3 Business Management 


4 Sales 
7 Military Activities 


1 Public Speaking 
8 Technical Res, 


2 Law/Politics 
9 Mathematics 


10 Science 


5 Merchandising 
6 Office Practices 


15 Recreational Leadership 


16 Medical Service 
17 Social Service 
18 Religious Activities 


19 Teaching 


20 Music 


13 Agriculture 
21 Art 


11 Mechanical 
12 Nature 
14 Adventure 
22 Writing 


21 
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The correlations between the Set 1 and 2 
scales with the same name are reported in 
Table 30 also, and they indicate the scales are 
essentially identical, with the mild exception 
of Religious Activities and Music, but even 
their correlations (.89 and .91) are high enough 
so that thescales could be used interchangeably. 


SCALE INTERCORRELATIONS 


The intercorrelations between the scales are 
presented in Table 31. Those above the diago- 
nal are based on the Set 1 scales, using the 
scores of a sample of 1927 Stanford University 
graduates. The correlations below the diagonal 
are based on the Set 2 scales, using the scores 
of the adult norm sample. 

Using these intercorrelations as a guide, the 
scales have been ordered so that the adjoining 
ones tend to be related. 


Suggested Profile 


The profile shown in Figure 2 has been pre- 
pared as an aid in interpreting these scores. 
The order of the scales was taken from the 
intercorrelation table. The heavy vertical line 
at 50 represents the mean score of the adult 
norm group; the lighter line, the teenage mean. 
The highest and lowest possible scores for each 
scale for both Set 1 and Set 2 are also marked. 


SCALE RELIABILITIES 


Scores are available for several groups that 
have been tested and retested with the SVIB 
over varying lengths of time. Test-retest cor- 
relations and means and standard deviations 
for both test and retest administrations are re- 
ported in Table 32. All of these comparisons 
use the Set 1 scales; as the Set 2 scales are 
usually longer, their reliabilities might be 
slightly higher. 

The 2-wk. test-retest sample contains mostly 
sophomores at the University of Minnesota, all 
volunteers from the University’s introductory 
psychology course. 

The 30-day sample includes the officers and 
men from an Army Reserve Unit at Fort 
Snelling, Minnesota. Almost all of them had 
college degrees. Their occupations were widely 
scattered, with a mild concentration in the ad- 
vertising and mass media fields. 

The Harvard students, who were tested in 
the spring of their freshmen year and retested 


in the fall of their senior year, were participants 
in an extensive longitudinal study being cos- 
ducted by Stanley King and Bruce Finnie, 

The YMCA Secretaries were a group first 
tested by Strong in 1927-30 when they were 
40 yr. old, and retested by Verburg in 1951 
when they were 63 and, for the most part, re- 
tired (Verburg, 1952). 

Further information on these retest groups, 
except the YMCA Secretaries, is available in 
the current SVIB manual (Campbell, 1966). 

The median test-retest correlation of these 
basic scales for each of these groups is given at 
the bottom of Table 32, and the analogous 
figure is reported for the regular SVIB occupa- 
tional scales. As can be seen, the basic scales 
are slightly but consistently less stable over 
time than the occupational scales; the longer 
length of the latter is undoubtedly the reason. 
The difference between the reliabilities of the 
two types of scales is probably trivial; the 
length of the test-retest interval is still far more 
important in determining level of reliability 
than is type of scale. 

Test-retest reliabilities for the basic scales 
are available from another set of men who were 
studied several times by E. K. Strong, Jr. He 
tested them originally as freshmen at Stanford 
in 1930, again the following year as sopho- 
mores, once more in 1939, 5 yr. after they had 
graduated, and, finally, in 1949, 15 yr. after 
they had graduated from Stanford (Strong, 
1955). Table 33 has the mean profiles for the 
group at each testing and the test-retest cor- 
relations over the various intervals. The 
number tested varied from one time to another 
and is reported at the bottom of the table for 
each comparison. From these figures one can 
draw the following conclusions: 


1. On most of the scales, the mean scores of 
this diverse group of students resembled the 
norm group’s mean of 50. As freshmen, their 
most significant deviations were their low 
scores on the Nature and Teaching scales, 4 
and 45, respectively, and their high score of 50 
on the Adventure scale. > 

2. Nineteen years later, when retested, their 
interests had shifted mainly in these same three 
areas and, on the Nature and Adventure scales, 
they closely resembled the norm group. On the 
Teaching scale, however, the shift had cart! 
them beyond the norm group to a score of 53. 


TABLE 32 
Basic Scare Means, STANDARD DEVIATIONS, AND TEST-RETEST CORRELATIONS FOR 


SEVERAL SAMPLES OVER VARYING Time PERIODS 


x . 


l 


3 e 
= 


5 — 
. — 
2 


22222525222277225757 


J yr. sample 


Harvard students 


23222 — — 


88328820 


Army Reserve group 
30-day sample 
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28 
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Note. —2-wk. sample N = 140; 30-day sample N = 102; 3-yr. sample N = 189; 22-yr. ample N = 47) 30yr. sample N = 08, 
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TABLE 33 


Basis SCALE MEANS, STANDARD DEVIATIONS, AND TEST-RETEST CORRELATIONS FOR STANFORD 
STUDENTS TESTED AS FRESHMEN, SOPHOMORES, 5 YEARS AFTER GRADUATION, 
AND 15 YEARS AFTER GRADUATION 


1949 
| 
S1 82 |523| 95| 5 .59 66 
4 56 | S36 9.1 51 6 66 
S4 31 | 52.4 | 10.5 | 48 55 78 
48 | 49 | 4783] 9.3] 46 | 30 81 | 
52 $7 | S17] 10.8] 1 56 1 
A8 57 51.3 | 10.2 40 4 72 | 
63 67 | 49.5 | 10.0 56 62 4 | 
81 S1 5141.1 51 50 69 
15 71 {327 | 9.2] .73 70 87 
OF 72 52.4 | 10.0 05 22 .16 
72 .77 | 48.6 | 11.2 72 20 85 
62 68 | 47.2 | 10.2 | 52 53 72 
61 67 | 475 9.8 -60 60 70 
54 SS | 52.0 | 11.1 59 5S 68 
57 | 68 45.9 9.5] 63 67 70 
66, 66 | 49.8 | 111 56 61 70 
34 | 44 | 48.2] 9.7] 38 | 49 77 
3⁰ 37 44.7 9.7 31 31 56 
AS AS | 50.7 8.9 40 53 62 
58 51 513 8.1 53 51 63 | 
59 60 | SIS | 9.8] 56 61 79 
67 | .73 | 53.9 | 10.2 | .64 ‘72 81 | 
56 37 54 56 -76 


Public Speaking $2.2 94 .79 52.8 9.8 
Law/ Politics $2.2 | 97] .79 | S38 | 10.1 
Business Management 49.2 | 103 20 511 | 10.5 
Sales 47.9 8.7 20 48.9 93 
Merchandising s03 | 102 72 | SUS | 10.7 
Office Practices $1.2 | 10,7 69 $2.3 | 11.2 
Military Activities $1.9) 99 | 78 |504| 98 
Technical Supervision 475) 118 68 49.1 | 10.6 
Mathematics Sil 99 MA | S12 99 
Science 49.2 | 104 82 | 49.1 | 10.0 
Mechanical 47.1 | 105 | 8S | 46.7 | 103 
Nature 43.3 | 104 Bl |434 | 106 
Agriculture 47.0 98 77 46.9 9.8 
Adventure $6.1 | 11.2 | 75 |573 |4 
Recreat Leadership | 51.3 | 8.8 | .77 |519 | 9.1 
Medical Service 48.7 | 104 $0 | 48.3 99 
49.2 9.7 70 |486 9.7 
Rel: Activities 45.5 | 93 | 68 | 44.3 9.5 
Teac! 448 | 96 | 69 |466 | 99 
Music 48.5 8.9 25 49.0 9.2 
Art 47.9 | 93 | 80 | 48,7] 99 
Wen 514 9.8 -79 53.5 | 10.0 
Mdn test-retest r 77 
N28. 
bN =175. 
‘N = 159, 
4N = 202, 
sN = 178, 
tN = 137, 


3. If one assumes that a 3-point shift—which 
is roughly one-third SD—is the smallest dif- 
ference worth attending to, the following shifts 
occurred over time within this sample: 


GAINS 
Teaching 8 points 
Nature 7 points 
Music 4 points 
Writing 4 points 
Art 3 points 
Business Management 3 points 
Technical Supervision 3 points 

LOSSES 
Adventure 6 points 
Military Activities 3 points 


“Middle-aging intellectual responsibility” is 
the apparent trend. 

4. Their scores on the remaining scales over 
this 19-yr. period did not shift even this mild 
amount. Interests in areas such as Public 
Speaking, Mathematics, Science, and Religious 
Activities were, as measured by these scales, 
surprisingly constant. 

5. The test-retest correlations reported at the 
bottom of Table 33 provide a measure of the 


stability of the rank order of the individuals 
within the total sample. The most intriguing 
finding here is that the stability over the 
10-yr. adult span, 1939-1949, was equal to 
the 1-yr. freshmen-sophomore period—further 
support for the familiar finding that interests 
are remarkably stable after age 25. 


Similar results for another group of college 
students are reported in the following section 
on validity. 


VALIDITY 


There is increasing awareness among psy- 
chologists that there is no single way of €s- 
tablishing the validity of any test or inventory 
as no one index can adequately represent the 
relationship between test behavior and real- 
world behavior. Even in those situations where 
the test is designed for a specific purpose, for 
example, the Medical School Aptitude Test, it 
is not clear which aspect of medical school per 
formance should be used as the criterion. For 
measures of interests, the situation is even more 
confusing as there are no criteria available that 
have even the relative clarity of medical schoo 
accomplishments. How can the validity of, says 
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TABLE M 


Baxr Descxtrrion or tae (Nc ral Saurus Wense Hasic Scare 
Maw N Arrzas ix Tastes 15-56 


Accountant 


Advertising Men 
Air Force Officer 


Animal Husbandry Prof, 


Junior accountants 


Alr Force officers training at Wright Patter. 
tom Air Force Base 

University faculty members 

University faculty 

Above average officers, mostly field grade, all 
West Point graduates 

Architects, mosily from California 


Astronauts, mostly military officers 
Roughly r journalists, 40% authors 


All listed in American Men of Science 


CPA (44) Senior Certified Public Accountants 
CPA (65) Minnesota Certified Public Accountants 
Cham Comm Exec Chamber of Commerce and Trade Asocia 
tion executives 
Chemist Industrial chemists 
Comm Rec Director Community recreation directors 
. l. iii Corporation 
350 presidents 
Corp Pres (65) Corporation presidents 
Credit Manager 
Danforth Fellow Danforth Foundation Fellows—aspiring col- 
lege professors 
Dentist 
Economist University faculty members 
_ Elem Teacher Minnesota elementary teachers 
Engineer 
Farmer (36) Farmers, mostly from Pacific Northwest 
Farmer (67) South Dakota farmers 
Forest Service Men 
Funeral Director 
Generals and Admirals 3- and 4-Star Army Generals and Navy 
Governor Retired state governors 
Guidance Counselor 
Interior Decorator Members, American Institute of Interior 
Design 
Judge Federal District Judges 
Lawyer (27) California lawyers 
Lawyer (40) Minnesota lawyers 
Legislator Senators and Representatives from Minne- 
sota Legislature 
Librarian 
Mathematici 
Math-Science Teacher High school mathematics-science teachers 
McKnight Fellow McKnight Fellows—actors in Tyrone Guth- 
4 rie Theater in Minneapolis 
Minister (27) 


*CIMR = Center for Interest Measurement Research, University of Minnesota. 
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TABLE 34— (Continued) 


Sample N Rie Investigator 
Minister (65) 96 | 1965 | CIMR 
Mpls. Symphony 53 | 1966 | CIMR 
Musician 450 | 1952 | Strong 
Music Teacher 493 | 1952 | Strong 
NIAL Member 42 | 1966 | CIMR 
Navy Officer 645 | 1965 | Stephenson & Abrahams 
Newsmen 283 | 1967 | CIMR 
Office Worker 326 | 1928 | Strong 
Optometrist 405 | 1963 | Acree 
Osteopath 585 | 1939 | Strong 
Personnel Director 147 | 1927 | Strong 
Petroleum Engineer 385 | 1965 | Alford 
Pharmacist 309 | 1947 | Schwebel 
Photographer 253 | 1967 | CIMR 
Physical Therapist 350 | 1957 | Strong 
Physician (27) 175 | 1927 | Strong 
Physician (’49) 534 | 1949 | Strong & Tucker 
Internist 209 | 1949 | Strong & Tucker 
Neurological Surgeon 47 | 1949 | Strong & Tucker 
Orthopedic Surgeon 71 | 1949 | Strong & Tucker 
Pathologist 154 | 1949 | Strong & Tucker 
Pediatrician 96 | 1949 | Strong & Tucker 
Physiatrist 460 | 1966 | Athelstan 
Psychiatrist 404 | 1949 | Strong & Tucker 
Radiologist 111 | 1949 | Strong & Tucker 
Surgeon 188 | 1949 | Strong & Tucker 
Urologist 84 | 1949 | Strong & Tucker 
Physicist (27) 192 | 1929 | Strong 
Physicist (67) 64 1967 | CIMR 
Pilot 510 | 1941 | Strong 
Policemen 254 | 1933 | Strong 
Political Scientist 177 | 1967 Rossmann & Lips 
Priest 234 | 1966 | Lepak 
Printer 270 | 1936 | Strong 
Production Manager 216 | 1935 | Strong 
Psychologist (31) 174 | 1931 | Strong 
Psychologist (’47) 149 | 1947 | Kelly 
Psychologist (’49) 1045 | 1949 | Kriedt 
Psychologist (Exp) 256 | 1949 | Kriedt 
Public Admin 550 | 1941 | Strong 


Brief description 


Members, Minneapolis Symphony Orchestra 
Mainly performing musicians 


Members, National Institute of Arts and 
Letters 

Above average naval officers 

Newspaper reporters and television news 
broadcasters 


Members, American Optometric Association 


Texas petroleum engineers 

New York pharmacists 

Mostly news photographers and photo- 
journalists 


Random sample of the American Medical 
Association » 
Representative sample of those boarded in 
designated specialty 
Representative sample of those boarded in 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 


of those boarded in 
of those boarded in 
of those boarded in 
of those boarded in 
of those boarded in 
of those boarded in 


of those boarded in 


Strong's original aviator group 
Police officers from several cities 
University faculty members 
Minnesota priests 


Veterans Administration clinical psycholo 
gists 3 

Ninety percent of APA membership 2 0 

Experimental psychologists, subset of 
total 1949 group i 

Public administrators from a varie 
lic agencies 


ty of pub- 


— n — . . —————ð&⁴.—Q . ³·³³ 3 ð ⁰ AAA I ̃ ̃ ⅛ ·—3x— gü. 
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TABLE 34—(Continued) 


| | 
Sample N . Investigator Brief description 

Pulitzer Prize 84 1965 | CIMR Pulitzer Prize winners, mostly journalists and 
novelists 

Purch Agent 219 | 1931 | Strong | Purchasing agents 

Rehab Counselor 272 | 1949 | Acree | Vocational rehabilitation counselors from the 
Veterans Administration 

Salesmen, Auto 116 | 1929 | Strong Automobile salesmen 

Salesmen, Computer 190 | 1964 | Stein Control Data Corporation computer sales- 
men 

Salesmen, Encyclopedia 49 | 1966 | CIMR | Outstanding World Book encyclopedia sales- 
men 

Salesmen, Life (31) 310 | 1931 | Strong 

Salesmen, Life (66) 76 | 1966 | CIMR $2,000,000 life insurance salesmen 

Salesmen, 3M 100 | c1960 | Kirchner Salesmen on staff of 3M 

Salesmen, 3M applicants | 100 | c1960 | Kirchner Men applying for sales positions at 3M 
company 

Salesmen, PG&E 179 | 1939 | Strong Appliance salesmen, Pacific Gas & Electric 
company 

Salesmen, Real Estate 243 | 1932 | Strong 

Salesmen, Steel 61 | 1966 | Berdie Structural steel salesmen 

Sales Manager 228 | 1932 | Strong 

School Super (30) 190 | 1930 | Strong School superintendents 

School Super (65) 153 | 1965 | CIMR School superintendents 

Soc Sci Teacher 217 | 1936 | Strong High school social science teachers 

Social Worker (’53) 400 | 1953 | McCornack 

Social Worker (67) 54 | 1967 | CIMR 

Sociologist 198 | 1965 | Rossmann & Lips University faculty members 

Student Personnel 192 | 1961 | Clark Members, American College Personnel Asso- 
ciation 

Unitarian Minister 113 | 1950 | Strong 

Vet (40) 310 | 1949 | Hannum Towa veterinarians 

Vet (66) 478 | 1966 | Hannum & Alsip Towa veterinarians 

YMCA PD 216 | 1927 | Strong YMCA phyiscal director 

YMCA Sec 113 | 1927 | Strong 

YMCA Staff 184 | 1961 | Seashore YMCA staff (comparable to combination of 
original physical directors and secretaries) 


a measure of interests in Religious Activities 
be studied? 

The establishment of validity for such mea- 
sures, it is the authors’ opinion, depends on 
the collection and organization of a vast 
amount of data covering several aspects of the 
inventory. All psychological tests and inven- 
tories have some potential validity—for 
persons do not give random answers—but until 
the relationships between the test behavior 
and an individual’s actions are thoroughly 
Studied, using many diverse approaches, the 
value of the test remains obscure. 

With this orientation, a substantial amount 
of information is presented here, showing that 
responses to these SVIB scales are related to 
the occupation that the individual chooses. 
The concept of occupational membership was 


used here because it has proven valuable in 
earlier work with the regular SVIB scales, 
because it is objective, because—as E. K. 
Strong frequently commented—hardly anyone 
finds fault with it, and because it is closely re- 
lated to the intended purpose of the SVIB, 
that is, as an aid in vocational counseling. 
The information that follows has been 
organized in three traditional areas: content, 
concurrent, and predictive validity. 


Content Validity 


Content validity refers to the actual content 
of the scales. For these basic scales, a certain 
amount of content validity is assured as each 
item has survived both a statistical and a 
common sense” screening to determine that, 
within each scale, the items are related. Con- 
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TABLE 35 
MALE Occupations MEAN SCORES ON THE ADVENTURE SCALE 
= — 
M stand- Sample 
ard score 
70 
69 
68 
67 
66 Astronaut 
65 
64 
63 
62 
61 
60 Salesmen, 3M applicants 
59 Salesmen, 3M/ Policemen/ Salesmen, Computer 
58 Air Force Officer/ Navy Officer 
57 Salesmen, Steel/ Pilot 
56 McKnight Fellow/ YMCA Staff/ Petroleum Engineer 
55 Photographer / Elem Teacher / Comm Rec Director / Computer Programmer 
54 CPA 65/ Army Officer/ Legislator/ Newsmen/ Salesmen, Life 66/ Social Worker 67 
53 Cham Comm Exec/ Salesmen, Auto/ Forest Service Men/ Physical Therapist 
52 Vet 66/ YMCA PD/ Optometrist/ Minister 65/ Orthopedic Surgeon 
51 Salesmen, Encyclopedia/ Generals and Admirals/ Danforth Fellow/ CPA 44/ Corp Pres 65/ Neuro- 
logical Surgeon 
50 Priest/ Sociologist/ Credit Manager/ Salesmen, PG&E/ Printer/ Carpenter/ School Super 65 
49 Radiologist/ Urologist/ Farmer 36/ Surgeon/ Mpls Symphony/ Osteopath/ Math-Science Teacher/ 
Student Personnel 
48 Psychologist 47/ Salesmen, Real Estate/ Public Admin/ Pediatrician/ Chemist/ Biologist/ Banker 
64/ Physiatrist/ Bus Ed Teacher/ Farmer 67/ Musician/ Physicist 67/ Anthropologist 
47 Accountant/ Social Worker 53/ Psychologist 31/ Office Worker/ Unitarian Minister/ Economist/ 
Production Manager/ Psychologist (Exp)/ Personnel Director/ Funeral Director/ Physician 49/ 
Psychiatrist/ Animal Husbandry Prof / Artist / Vet 49 
46 Pharmacist/ Soc Sci Teacher/ Physician 27/ Engineer/ Purch Agent/ Political Scientist/ Sales Man- 
ager/ Librarian/ Dentist/ Lawyer 49/ Lawyer 27/ Pathologist/ Authors/ Interior Decorator/ 
Governor/ Advertising Men 
45 Rehab Counselor/ Architect/ YMCA Sec/ Salesmen, Life 31/ Guidance Counselor/ Pulitzer Prize/ 
Minister 27/ Psychologist 49/ Music Teacher/ Internist 
44 Judge 
43 Banker 34/ Corp Pres 35/ Physicist 27 
42 
41 School Super 30 
40 Mathematician/ NIAL Member 
39 
38 
37 
36 
35 


Note.—In some cases occupational groups were sampled and tested in different years (e. g., Banker, see Table 34), Thus numbers 
following samples indicate appropriate year, 


sequently one has some confidence that when 
a person answers, for example, Like or Dislike 
to the items, Algebra, Arithmetic, Calculus, 
Geometry, Mathematics, and Physics, he is 
telling something about his interests in 
mathematics. 

However, although content validity is quite 
helpful in scale interpretation, by itself it 


reveals nothing about the relationship betwee? 
an individual’s interests and his actual be 
havior. For such information, one needs t° 
turn to concurrent and predictive validity. 


Concurrent Validity 


Concurrent validity refers to the power of 
the test to distinguish between specified groups 
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TABLE 36 
MALE OCCUPATIONS MEAN SCORES ON THE AGRICULTURE SCALE 


M stand- 
ard score Sample 

70 

69 

68 

67 

66 

65 

64 

63 Farmer 67 

62 

61 Farmer 36/ Vet 49 

60 Vet 66 

59 Forest Service Men/ Animal Husbandry Prof 

58 

57 

56 

55 

54 Pilot 

53 Army Officer/ Generals and Admirals 

52 Orthopedic Surgeon 

51 Salesmen, Encyclopedia/ Policemen/ Comm Rec Director/ Physical Therapist/ Astronaut/ Governor 

50 YMCA PD/ Salesmen, PG&E/ Biolegist/ Urologist/ Legislator/ Osteopath 


49 Public Admin/ Radiologist/ Carpenter/ Surgeon/ Salesmen, Life 66/ Petroleum Engineer / Neuro- 
logical Surgeon 

48 Salesmen, Real Estate/ YMCA Sec/ Salesmen, Steel/ Salesmen, Life 31/ YMCA Staff / Dentist/ 
Banker 34/ Minister 27/ Salesmen, Computer/ Funeral Director/ Air Force Officer/ Navy Officer/ 
Physiatrist 

47 Salesmen, 3M applicants/ Salesmen, 3M/ Soc Sci Teacher/ Pediatrician/ Physician 27/ Credit Man- 
ager/ Photographer/ Elem Teacher/ Sales Manager/ CPA 44/ Banker 64/ Corp Pres 65/ Physician 
49/ Artist 

46 Chemist/ Engineer/ School Super 30/ Unitarian Minister/ Production Manager/ Personnel Director/ 
Salesmen, Auto/ Corp Pres 35/ Bus Ed Teacher/ Pathologist/ Math-Science Teacher 

45 Rehab Counselor/ Priest/ Cham Comm Exec/ Architect/ Social Worker 53/ Office Worker/ Printer/ 
Guidance Counselor/ McKnight Fellow/ Optometrist/ Computer Programmer/ Lawyer 49/ School 
Super 65/ Psychiatrist/ Social Worker 67/ Music Teacher/ Musician/ Internist/ Anthropologist 
44 Pharmacist/ Accountant/ Psychologist (Exp)/ Purch Agent/ Lawyer 27/ Newsmen/ Minister 65/ 
Advertising Men/ Physicist 27/ Psychologist 49/ Authors 

43 Psychologist 31/ Mpls Symphony/ Danforth Fellow/ Physicist 67/ Interior Decorator/ NIAL Mem- 
ber/ Student Personnel 5 a xs 

42 Psychologist 47/ Sociologist/ CPA 65/ Librarian/ Pulitzer Prize/ Judge/ Mathematician 

41 Economist 

40 Political Scientist 

39 
38 
37 
36 
35 


Note. In some cases occupational groups were sampled and tested in different years (e. g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


at the current moment. For example, can these The relevant data for concurrent validity 


scales discriminate between the interests of are presented in Tables 34 through 56. For 
men who are currently salesmen or scientists? each of the basic scales, the means of 113 occu- 
This is in contrast to predictive validity where pational samples are rank ordered, thus per- 
the issue is whether the scales can identify dif- mitting at a glance an evaluation of the power 


ferences between boys who will become sci- of the scale to separate occupations. 
elitists or salesmen These occupational samples represent the 


CAMPBELL, BORGEN, Eastes, JOHANNSON, AND PETERSON 


TABLE 37 
MALE OCCUPATIONS MEAN SCORES ON THE ART SCALE 


ard score 5 
69 
68 
67 
66 
65 Interior Decorator 
6 Architect/ McKnight Fellow 
63 Artist 
62 
61 
60 Photographer 
59 Mpls Symphony/ NIAL Member 
58 Pulitzer Prize 
57 Unitarian Minister/ Librarian/ Danforth Fellow/ Musician 
56 Advertising Men/ Physiatrist/ Music Teacher/ Anthropologist/ Neurological Surgeon 
55 Psychologist 47/ Psychologist 31/ Elem Teacher/ Minister 27/ Minister 65/ Orthopedic Surgeo 
Authors 
54 Priest/ Sociologist/ Social Worker 53/ Pediatrician/ Biologist/ Economist/ Psychologist (Exp) 
Dentist/ Newsmen/ Psychiatrist 
83 Salesmen, 3M/ YMCA Sec / Urologist/ Personnel Director / Political Scientist / Judge / Ph 
Therapist / Psychologist/ 49/ Social Worker 677 Physicist 67/ Pathologist 
52 Salesmen, 3M applicants/ YMCA PD/ Radiologist/ Physician 27/ Salesmen, PG&E/ Carpenter, 
Surgeon/ Salesmen, Encyclopedia/ Comm Rec Director/ YMCA Staff / Computer Programmer 
Generals and Admirals / Physicist 27/ Physician 49/ Internist / Student Personnel 
5¹ Rehab Counselot / Cham Comm Exec / Public Admin / Office Worker / Engineer / Printer / Corp P 
35/ Optometrist / Bus Ed Teacher / Osteopath / Astronaut / Mathematician 
50 Accountant/ Soc Sci Teacher/ Chemist/ Credit Manager/ Salesmen, Life 31/ School Super 30 
Guidance Counselor/ Funeral Direc/ Policemen/ Army Officer / Legislator/ School Super 65/ Salesme 
Life 66/ Math-Science Teacher 
49 Salesmen, Real Estate/ Salesmen, Steel / Air Force Officer/ Production Managers/ Salesmen, Auto) 
Korat Service Men / Pilot / Sales Manager / Salesmen, Computer/ APA 44/ Navy Officer / Lawyer 2 
vernor 
48 Pharmacist/ Farmer 36/ Purch Agent/ Lawyer 49/ Animal Husbandry Prof/ Petroleum Engineer 
47 Vet 66/ CPA 65/ Banker 34/ Banker 64/ Corp Pres 65 
46 Vet 49 
45 Farmer 67 
44 
43 
42 
41 
40 
39 
38 
37 
36 
35 


Note.—In some cases occupational groups were sampled and tested in di 4). Thus numbers 
following samples indicate appropriate year. ed in different years (e. g., Banker, see Table 34). 


majority of adult men who have been tested 
for research purposes with the SVIB since it 
was first published in 1927. Further informa- 
tion on each sample is listed in Table 57. 
Most of these occupational samples include 
men with at least 3 yr. of experience who say 


they like their work. More extensive info ma 
tion on many of the samples can be found” 
the SVIB manual (Campbell, 1966). f 

One large block of these samples came from 
Strong’s files, including those used to build tht 


E i 
occupational scales, another block came from 
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TABLE & 


Mare Orcuranoxs Maas Sopars on mue Posixrss Manacewenr Sears 


M stand- | 
ard score | 


62 Salesmen, 3M applicant 
ól Salesmen, 3M/ Credit Manager 


nea — — — === 


Sample 


60 Cham Comm Exec/ Sales, Encyclopedia / Bus Ed Teacher 
59 YMCA Sec/ Personnel Director/ Salesmen, Computer 


58 Salesmen, Steel/ Comm Rec Director 


57 Rehab Counselor/ Accountant/ Office Worker/ Salesmen, PB&E/ Sales Manager/ YMCA Stall/ 


School Super 65 


56 Guidance Counselor / Purch Agent/ Legislator 
55 Public Admin/ Corp Pres 35/ Funeral Director/ CPA 65/ Production Manager/ Army Officer/ 


CPA 44 


s4 Soc Sci Teacher / Social Worker 53/ Salesmen, Life 31/ Air Force Officer/ Salesmen, Auto/ Navy 


Officer/ Banker 64/ Governor 


53 Salesmen, Real Estate / School Super 30/ Banker 34/ Generals and Admirals/ Corp Pres 68/ Saleemen, 


Life 66 


Astronaut 


Vet 49 


Surgeon/ 
27/ Newsmen/ Psychiatrist/ Farmer 67/ Osteopath/ 


45 Sociologist/ Radiologist/ Judge/ Animal Husbandry Prof/ Musician 
44 Psychologist 31/ Surgeon/ Political Scientist/ Dentist/ Neurological Surgeon 
43 Architect/ Psychologist (Exp)/ Physician 49/ Internist 


42 Biologist/ Danforth Fellow 


41 | Physician 27/ Photographer / Mpls Symphony/ Pathologist 
40 | McKnight Fellow/ Physicist 27/ Physicist 67/ Mathematician/ Authors 


39 Anthropologist 
38 Pulitzer Prize 


36 Artist/ NIAL Member 


Note. In some cases occupational groups were sampled and tested in different years (e.g., Banker. see Table 34). Thus numbers 


following samples indicate appropriate year. 


the work of the present authors’ research in- 
stitute at Minnesota, and the remainder came 
from various other investigators working with 
the SVIB. Practically everyone in the latter 
group who was asked for his data was co- 
operative—if the information had not been 
lost or discarded—and the authors would like 
to publicly acknowledge gratitude for their 
help. 


The mean scores in Tables 35 through 56 are 
reassuring. Occupations which should score 
high did so; military officers scored highest on 
the Military Activities scale, ministers scored 
highest on the Religious Activities scale, sci- 
entists scored highest on the Science scale and 
so on—there were virtually no surprises. In 
fact, the results are so straightforward as to 
lead one to that conclusion voiced frequently 
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TA 


S, JOHANNSON, AND PETERSON 


BLE 39 


Mate Occurations MEAN Scores on THE Law/Pourrics SCALE 


M stand- 
ard score 


Sample 


70 
69 


62 Judge/ Legislator 

61 Salesmen, 3M applicants/ Salesmen, 3M/ 
60 Cham Comm Exec 

59 Laywer 49 


Political Scientist/ Salesmen, Computer/ Governor 


58 Social Worker 53/ Credit Manager/ CPA 65/ Economist/ Navy Officer/ Lawyer 27/ Newsmen/ 


Salesmen, Life 66/ Student Personnel 
57 Rehab Counselor / Unitarian Minister / Ai 


r Force Officer / Salesmen, Encyclopedia / Comm Rec Di- 


rector / Army Officer / School Super 65 / Social Worker 67 


56 Priest/ Sociologist/ Salesmen, Steel/ Guidance Counselor / Personnel Director/ YMCA Staff / Gen- 


erals and Admirals / Bus Ed Teacher / Dan 
55 Public Admin / Social Sci Teacher / School 
54 


forth Fellow 
Super 30/ Elem Teacher / Minister 65 / Astronauts 


Psychologist 47 Policemen / Optometrist / Computer Programmer / Corp Pres 65 / Petroleum Engi- 


neer 


53 YMCA Sec/ Salesmen, PG&E/ McKnight Fellow/ CPA 44/ Physiatrist/ Physical Therapist/ Psy- 


chiatrist/ Psychologist 49 


52 Accountant/ Pediatrician/ Office Worker/ Salesmen, Life 31/ Salesmen, Auto/ Librarian/ Banker 34/ 


Prize/ Banker 64 


Minister 27/ Orthopedic Surgeon / An thropologist/ Neurological Surgeon £ 
51 Vet 66/ Salesmen, Real Estate/ Psychologist 31/ Radiologist/ Urologist/ Psychologist (Exp)/ Pulitzer 


50 Printer/ Production Manager/ Purch Agent/ Pilot/ Sales Manager 


49 Pharmacist/ YMCA PD/ Corp Pres 35/ 


Biologist/ Photographer/ Surgeon/ Forest Service Men/ 


Animal Husbandry Professor/ Music Teacher/ Osteopath/ Internist/ Math-Science Teacher 


48 Funeral Director/ Mpls Symphony/ Advertising Men/ Physician 49/ Musician / Physicist 67/ Authors 


47 Engineer/ Farmer 36/ Dentist/ Pathologis 


t / Vet 49 


46 Physician 27/ Chemist/ Carpenter/ Mathematician/ Interior Decorator/ NIAL Member 


45 Architect/ Farmer 67 
44 Physicist 27 


42 Artist 


Note.—In some cases occupational groups were sampled and 


following samples indicate appropriate year. 


by students in introductory psychology 
courses: “You had to do all that research just 
to find that out?” 

Clearly, scores on these scales are related, in 
a highly significant fashion, to the occupations 
of adult men. Anyone who would understand 
these scales thoroughly must spend some time 


studying these tables of means, both to learn 
more about the relative levels of scores on each 
scale, and to learn more about each occupation. 

These rank-ordered means provide a firmer 
foundation for ascertaining the importance of 
scores at various levels. Scanning these tables 


| 

| 
tested in different years (e. g., Banker, see Table 34). Thus numbers 
suggests that scores above 58 or 59 are high | 
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Psychologist 31/ Danforth Fellow/ Physiatrist/ Math-Science Teacher 

Accountant/ Pediatrician/ School Super 30/ Biologist/ Urologist/ Production Manager/ Elem 
Teacher/ Pilot/ School Super 65/ Psychologist 49/ Pathologist/ Orthopedic Surgeon/ Neurological 
Surgeon 

52 Architect / Radiologist / Credit Manager / Salesmen, Steel / Corp Pres 35/ Forest Service Men / Banker 
64 / Psychiatrist 

51 Salesmen, 3M/ Public Admin / Office Workers / Personnel Director / Surgeon / Sales Manager / Lawyer 
49/ Physician 49 / Internist 

50 Psychologist 47/ Vet 66/ Pharmacist/ Sociologist/ Salesmen, PG&E/ Guidance Counselor/ Carpen- 
ter/ Farmer 36/ Purch Agent/ Judge/ Legislator/ Physical Therapist/ Bus Ed Teacher/ Student Per- 
sonnel 

49 Rehab Counselor/ Physician 27/ Salesmen, Auto/ Banker 34/ Lawyer 27/ NIAL Member/ Governor 
48 Salesmen, 3M applicant/ YMCA PD/ Policemen/ Political Scientist/ Mpls Symphony/ Comm Rec 
Director/ Dentist/ Minister 27/ Osteopath/ Anthropologist/ Vet 49 

47 Cham Comm Exec/ YMCA Sec / Salesmen, Life 31/ Printer / Unitarian Minister/ Funeral Director/ 
Salesmen, Encyclopedia 

46 Priest/ Salesmen, Real Estate/ Photographer/ McKnight Fellow/ Librarian/ YMCA Staff/ Minister 
65 / Salesmen, Life 66/ Music Teacher/ Musician 

45 Soc Sci Teacher/ Social Worker 53/ Social Worker 67/ Farmer 67/ Advertising Men 


37 

56 

55 Psychologist (Exp)/ Optometrist/ CPA 44/ Navy Officer/ Animal Husbandry Prof 
54 

53 


TABLE 40 
. MALE OCCUPATIONS MEAN SCORES ON THE MATIRMATICS SCALE 
M stand- Sample 
ard score 
70 
69 
68 
67 
66 
65 
64 | 
63 
62 
61 Astronauts 
60 Physicist 27/ Physicist 67/ Mathematician 
59 
58 Air Force Officer / Computer Programmer / Salesmen, Computer / Petroleum Engineer 
57 Engineer / Army Officer 
Chemist / CPA 65 / Economist / Generals and Admirals / Corp Pres 65 


43 Newsmen 
42 Pulitzer Prize/ Artist/ Interior Decorator 


40 Authors 


Note.—In some cases occupational groups were sampled and tested in different years (e. g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


enough to be important from an occupational this interpretation will obviously be necessary, 
standpoint; scores below 42 or 43 indicate im- especially with teenagers and especially with 
portant areas of rejection, at least among the Adventure scale where one knows high 
adults. Thus, the present authors’ recommen- scores will likely decrease over time, but these 
dation is that scores above 58 be considered figures can be used as initial landmarks. h 

high, those below 42, low. Some sharpening of Though these data are positive and meaning- 
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TABLE 41 
Mate Occupations MEAN ScorEs ON THE MECHANICAL SCALE 


M stand- 
ard score 


Sample 


70 
69 
68 
67 


5 
56 Air Force Officer/ Carpenter/ Orthopedic Surgeon 


54 Radiologist/ Pilot/ Army Officer / Farmer 67/ Neurological Surgeon 

53 Engineer/ Urologist/ Physicist 27/ Petroleum Engineer 

52 Psychologist (Exp)/ Computer Programmer/ Physical Therapist/ Astronaut/ Math-Science Teacher 
51 Chemist/ Printer/ Production Manager/ Surgeon/ Forest Service Men/ Policemen/ Navy Officer/ 


Physiatrist/ Physicist 67 


50 Pediatrician/ Salesmen, PG&E/ Farmer 36/ Optometrist/ Salesmen, Computer/ Generals and 


Admirals/ Pathologist 


49 Biologist/ Dentist/ CPA 44/ Psychiatrist/ Osteopath 
48 Architect/ YMCA PD/ Psychologist 31/ Credit Manager/ Personnel Director/ Elem Teacher/ 


Physician 49/ Psychologist 49 


47 Vet 66/ Public Admin/ Accountant/ Salesmen, Auto/ Purch Agent/ Comm Rec Director/ Minister 


Vet 49 


27/ Banker 64/ Corp Pres 65/ Animal Husbandry Prof/ Bus Ed Teacher/ Internist/ Mathematician/ 


46 Salesmen, 3M applicants/ Salesmen, 3M/ YMCA Sec/ Physician 27/ Salesmen, Steel/ Office Worker/ 


Unitarian Minister/ Guidance Counselor/ Corp Pres 35/ Mpls Symphony/ YMCA Staff / Musician/ 


45 Psychologist 47/ Rehab Counselor/ Pharmacist/ Social Worker 53/ CPA 65/ Photographer/ Social 


Worker 67 


44 Priest/ Cham Comm Exec/ Soc Sci Teacher 
Legislator/ Banker 34/ Minister 65/ School Su 
43 Salesmen, Real Estate/ Sociologist / Economis 


42 Danforth Fellow 


41 Salesmen, Encyclopedia/ McKnight Fellow. 


/ School Super 30/ Funeral Director/ Sales Manager/ 
per 65/ Music Teacher / Artist 
t / Librarian / Anthropologist/ Student Personnel 


/ Lawyer 49/ Advertising Men/ Governor 


40 Salesmen, Life 31/ Lawyer 27/ Interior Decorator 


39 Judge/ Newsmen 


38 Political Scientist/ Pulitzer Prize/ Salesmen, Life 66/ Authors/ NIAL Member 


Note.—In some cases occupational groups were 
following samples indicate appropriate year. 


ful, one disappointing aspect—at least for the 
senior author—was the magnitude of differ- 
ences between extreme groups. On most of the 
scales, the range from the highest mean to the 
lowest was about 20 standard score points, or 
2 SDs. While that is a substantial separation, 
especially when viewed against results from 


sampled and tested in different years (e. g., Banker, see Table 34). Thus numbers 


other tests (if reported in percentiles, that 
would be the difference between the fifteenth 
and eighty-fifth percentiles), still it is less than 
the 3 or 4 SDs found between extreme scores 
on the regular SVIB occupational scales. To 
achieve purity of content, and thus easier 
interpretation, empirical validity has suffered. 
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TABLE 42 
MALE Occupations MEAN SCORES ON THE MEDICAL SERVICE SCALE 
M stand- 
ard score Sample 
70 
69 
68 
67 
66 
65 
64 Dentist/ Physical Therapist 
63 Vet 66/ Urologist/ Physiatrist/ Osteopath/ Vet 49 
62 Pharmacist/ Radiologist/ Pediatrician/ Surgeon/ Physician 49/ Neurological Surgeon 
61 Physician 27 


60 Optometrist/ Psychiatrist/ Pathologist/ Internist/ Orthopedic Surgeon 


59 YMCA PD/ Animal Husbandry Prof 
58 Biologist 
57 Psychologist 47/ Math-Science Teacher 


56 Psychologist (Exp) / Comm Rec Director/ YMCA Staff 


55 Psychologist 31/ Psychologist 49 


54 Salesmen, 3M applicants/ Rehab Counselor/ Priest/ Social Worker 53/ Chemist/ Elem Teacher/ 
Policemen/ Minister 27/ Social Worker 67/ Astronaut 
53 YMCA Sec / Unitarian Minister / Guidance Counselor/ Funeral Director/ Salesmen, Computer/ 


School Super 65/ Student Personnel 


52 Salesmen, 3M/ Soc Sci Teacher/ Credit Manager/ Air Force Officer/ Mpls Symphony/ Army Officer/ 


neer 


Computer Programmer/ Legislator/ Danforth Fellow/ Minister 65/ Salesmen, Life 66/ Petroleum Engi- 


51 Cham Comm Exec/ Public Admin/ Salesmen, PG&E/ Forest Service Men/ Pilot/ Navy Officer/ 


Musician/ Governor 


50 Sociologist/ Salesmen Steel/ Office Worker/ School Super 30/ Personnel Director/ Farmer 36/ Sales- 


men, Auto / Librarian / CPA 44/ Music Teacher / Bus Ed Teacher/ Anthropologist 


49 Salesmen, Life 31/ Printer/ CPA 65/ Production Manager/ Photographer/ McKnight Fellow/ Gen- 


erals and Admirals/ Physicist 67 


48 Accountant/ Economist/ Carpenter/ Salesmen, Encyclopedia/ Sales Manager/ Pulitzer Prize/ 


Judge/ Lawyer 49/ Newsmen/ Banker 64/ Physicist 27/ NIAL Member 


47 Salesmen, Real Estate / Engineer / Corp Pres 35/ Purch Agent / Banker 34 / Lawyer 27 / Corp Pres 65/ 


Artist 


46 Architect/ Political Scientist/ Advertising Men/ Mathematician/ Author/ Interior Decorator 


45 Farmer 67 


‘ote.—In some cases occupational groups were sampled and tested in different years (e. g., Banker, see Table 34). Thus numbers 


N. 
following samples indicate appropriate year. 


Predictive Validity 


The predictive validity of these scales can be 
demonstrated by reanalyzing some SVIBs 
collected by Berdie (1960) and Schletzer 
(1963). Berdie identified students who gradu- 
ated from the University of Minnesota in cur- 


ricula that are highly predictive of eventual 
occupations, specifically, accounting, dentistry, 
journalism (which is less predictive than the 
others), law, mechanical engineering, and 
medicine. Each of these students had com- 
pleted the SVIB as high school seniors, and 
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TABLE 43 
MALE OCCUPATIONS MEAN SCORES ON THE MERCHANDISING SCALE 


M stand- 
ard score 


Sample 


70 
69 
68 
67 
66 
65 
64 
63 
62 Salesmen, 3M 


61 Salesmen, 3M applicants/ Bus Ed Teacher 


60 
59 Credit Manager 


58 Cham Comm Exec/ Salesmen, PG&E/ Salesmen, Encyclopedia 

57 Salesmen, Steel/ Office Worker/ Salesmen, Computer 

56 Pharmacist/ YMCA Sec/ Funeral Director/ Salesmen, Auto/ Purch Agent 

55 Comm Rec Director 

54 Rehab Counselor/ Accountant/ Soc Sci Teacher/ Guidance Counselor/ CPA 65/ Sales Manager/ 
YMCA Staff/ Interior Decorator 

53 Salesmen, Real Estate/ Air Force Officer/ Legislator/ School Super 65 f 

52 Salesmen, Life 31/ Personnel Director/ Elem Teacher/ Army Officer/ CPA 44/ N. avy Officer / Banker 
64/ Salesmen, Life 66 

51 YMCA PD / Corp Pres 35/ Optometrist / Banker 34/ Petroleum Engineer / Math- Science Teacher 

50 Vet 66/ Public Admin / Social Worker 53/ Advertising Men 

49 


Music Teacher 


Production Manager/ Librarian/ Computer Programmer/ Generals and Admirals/ Physical Therapist/ 


48 Priest/ School Super 30/ Printer/ Urologist/ Policemen/ Lawyer 49/ Minister 65/ Corp Pres 65/ 


Physiatrist/ Student Personnel/ Governor 


45 Sociologist/ Chemist/ Economist/ Dentist/ Psychiatrist/ Psychologist 49/ Neurological Surgeon 
44 | Judge/ Psychologist 31/ Engineer/ Surgeon/ Mpls Symphony/ Lawyer 27/ Newsmen/ Physician 49/ 


Internist 
43 Architect/ Biologist 


42 Physician 27/ Psychologist (Exp)/ McKnight Fellow/ Political Scientist 


41 Photographer/ Pathologist/ Mathematician 


40 Danforth Fellow/ Physicist 67 
39 Physicist 27/ Authors/ Anthropologist 
38 Pulitzer Prize/ Artist 


36 NIAL Member 


Note. In some cases occupational 
following samples indicate appropriate year, 


Berdie’s report indicates that there were sub- 
stantial relationships between their high school 
SVIB profile and the curriculum they selected. 
Schletzer, for her doctoral dissertation, lo- 
cated and retested these same students approxi- 
mately 4 yr. after they graduated from the 
University of Minnesota, roughly 8-10 yr. after 
the initial testing. Although these groups were 
small, approximately 30 in each of the six 


1 groups were sampled and tested in different years (e. g., Banker, see Table 34). Thus numbers 


curricula, the results are very important as no 
other occupational samples have been tested 
before college and retested several years later 
after entry into the occupation. 

The relevant data are presented in Tables 57 
and 58. In Table 57 are listed, for the total 
Sroup, test and retest means, standard devia- 
tions, and test-retest correlations. For each of 
the occupational subgroups, the test and retest 


X 


M stand- 
ard score 
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TABLE 44 
MALE Occupations MEAN Scores ON THE MILITARY SCALE 


Sample 


Army Officer/ Generals and Admirals 


Policemen 
Air Force Officer 


YMCA PD/ Pilot 


Navy Officer/ Astronaut/ Governor 

Comm Rec Director 

Salesmen, 3M applicants/ Rehab Counselor/ Public Admin/ Credit Manager/ Office Worker/ Sales- 
men, PG&E/ YMCA Staff/ Legislator/ Salesmen, Computer/ Physical Therapist 

Salesmen, 3M/ Vet 66/ Cham Comm Exec/ Engineer/ Personnel Director/ Salesmen, Auto/ Optome- 
trist / CPA 44/ Farmer 67/ Osteopath / Vet 49 

Accountant/ Soc Sci Teacher/ Salesmen, Steel/ Production Manager/ Farmer 36/ Purch Agent/ 
Funeral Director/ Lawyer 27/ School Super 65/ Banker 64/ Bus Ed Teacher/ Petroleum Engineer/ 
Math-Science Teacher 

Salesmen, Real Estate/ Pharmacist/ Salesmen, Life 31/ Guidance Counselor/ Carpenter/ Salesmen, 
Encyclopedia/ Elem Teacher/ Forest Service Men/ Sales Manager/ Dentist/ Animal Husbandry 
Prof/ Orthopedic Surgeon/ Advertising Men 

Architect/ YMCA Sec/ Physician 27/ School Super 30/ Printer/ CPA 65/ Urologist/ Banker 34/ 
Lawyer 49/ Physician 49/ Physiatrist/ Salesmen, Life 66/ Music Teacher / Student Personnel 

Judge / Priest / Social Worker 53/ Chemist/ Surgeon/ Computer Programmer/ Corp Pres 35/ News- 
men/ Psychiatrist/ Social Worker 67/ Neurological Surgeon 

Psychologist 31/ Radiologist/ Biologist/ Psychologist (Exp)/ Minister 27/ Pathologist/ Authors 
Psychologist 47/ Librarian / Minister 65/ Physicist 27/ Psychologist 49/ Musician / Internist / Interior 
Decorator 

Pediatrician / Unitarian Minister / Economist / Photographer / Political Scientist / Pulitzer Prize/ 
Artist / Mathematician 

Sociologist / Corp Pres 65 / Physicist 67/ Anthropologist 

Mpls Symphony / Danforth Fellow/ NIAL Member 


McKnight Fellow 


Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34), Thus numbers 
following samples indicate appropriate year. 


means are presented, and the mean changes 


Table 57, Table 58 was prepared. In this table 


larger than 4 points are identified. Four points, are listed for each subgroup: 


which is four-tenths of a standard deviation, 
is probably the smallest practical difference 


1. Their three highest scores as high school 


worth attending to among these small samples. seniors. 
As an aid to interpretation of the data in 2. Their three highest scores on retest, when 
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TABLE 45 


MALE OCCUPATIONS MEAN SCORES ON THE Music SCALE 


M stand- 
ard score 


Sample 


Music Teacher 
McKnight Fellow/ Musician/ Mpls Symphony 


Danforth Fellow 
NIAL Member 


Librarian/ Minister 65 

Unitarian Minister/ Minister 27/ Physiatrist 

Psychologist 47/ Pediatrician/ Economist/ Photographer/ Political Scientist / Pulitzer Prize 

Priest/ Sociologist/ Social Worker 53/ Elem Teacher/ Newsmen/ Psychiatrist/ Interior Decorator/ 
Anthropologist ; 

YMCA Sec/ Psychologist 31/ Biologist/ Social Worker 67/ Artist/ Pathologist/ Internist 

Radiologist/ Psychologist (Exp)/ Computer Programmer/ Physical Therapist/ Psychologist 49/ 
Physicist 67/ Orthopedic Surgeon/ Neurological Surgeon/ Student Personnel 

Salesmen, 3M applicants/ Cham Comm Exec/ Architect/ Soc Sci Teacher/ YMCA PD/ Credit Man- 
ager/ Printer/ Urologist/ Surgeon/ Optometrist/ Comm Rec Director/ Salesmen, Computer/ School 


Corp Pres 65/ Animal Husbandry Prof/ Astronaut/ Authors 


Physician 27/ Salesmen, Life 31/ Engineer/ Production Manager/ Policemen/ Pilot/ Dentist/ Fu- 
neral Director/ Air Force Officer/ Banker 64 


Vet 66/ Farmer 36/ Purch Agent/ Forest Service Men/ Sales Manager/ Banker 34/ Lawyer 49/ 
Corp Pres 35/ Lawyer 27/ Petroleum Engineer/ Vet 49 
Salesmen, Real Estate/ Governor 


Farmer 67 


Note.—In some cases occupational groups were sampled and tested in diff Thus numbers 
following samples indicate appropriate year, erent years (e.g., Banker, see Table 34) 


they were roughly 3-4 yr. beyond their final 


degree. 


3. The three largest gains for each sample, 
which frequently were not among the highest 
scores on either testing. 


Although, again, there is no one index to de- 
termine the predictive efficiency of these scales, 
the results are reassuringly meaningful. The 
scales that each group scored highest on, both 
at test and retest, are, with a few exceptions, 


: 
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TABLE 46 
MALE OCCUPATIONS MEAN SCORES ON THE NATURE SCALE 


M stand- Sample 
ard score 

70 

69 

68 

67 

66 

65 

64 

63 

62 

61 

60 

59 Farmer 36/ Animal Husbandry Prof 

58 Forest Service Men 

57 Vet 49 

56 Vet 66/ Biologist 

55 Orthopedic Surgeon 

54 Radiologist/ Urologist/ Physiatrist 

53 Pediatrician/ Surgeon/ Minister 27/ Physical Therapist/ Farmer 67/ Osteopath/ Pathologist/ 
Neurological Surgeon 

52 Public Admin/ YMCA Sec/ Dentist/ Physician 49/ Psychiatrist/ Artist/ Internist/ NIAL Nember 

51 YMCA PD/ Physician 27/ Comm Rec Director/ Generals and Admirals/ Math-Science Teacher 

50 Rehab Counselor/ Architect/ Chemist/ School Super 30/ Unitarian Minister/ Guidance Counselor/ 
Mpls Symphony/ School Super 65/ Anthropologist/ Governor 

49 Psychologist 47/ Soc Sci Teacher/ Social Worker 53/ Psychologist 31/ Credit Manager/ Psychologist 
(Exp)/ Personnel Director/ Carpenter/ Elem Teacher/ Policemen/ Librarian/ Pulitzer Prize/ Army 
Officer/ Banker 34/ Legislator/ CPA 44/ Psychologist 49/ Music Teacher/ Interior Decorator 

48 Pharmacist/ Sociologist/ Accountant/ Engineer/ McKnight Fellow/ Pilot/ Optometrist/ Lawyer 49/ 
Corp Pres 35/ Funeral Director/ Minister 65/ Physicist 27/ Bus Ed Teacher/ Musician/ Physicist 67/ 
Astronaut/ Mathematician/ Authors/ Student Personnel 

47 Priest/ Office Worker/ Salesmen, Life 31/ Salesmen, PG&E/ Printer/ Production Manager/ Photog- 
rapher/ Sales, Encyclopedia / Judge / YMCA Staff / Computer Programmer / Corp Pres 65/ Banker 64/ 
Social Worker 67/ Petroleum Engineer 

46 Salesmen, Real Estate / Economist / Sales Manager / Danforth Fellow / Navy Officer 

45 Cham Comm Exec/ Salesmen, Steel/ CPA 65/ Purch Agent/ Political Scientist/ Air Force Officer / 
Lawyer 27 Newsmen/ Advertising Men 

44 Salesmen, Life 66 

43 Salesmen, Auto/ Salesmen, Computer 

42 Salesmen, 3M 

41 Salesmen, 3M applicants 

40 

39 

38 

yk 

36 

35 


Note. In some cases occupational groups were 
following samples indicate appropriate year. 


those most related to their work. The excep- 
tions were almost all due to the Adventure 
scale; this was among the three highest scores 
for five of the samples at test, and remained 
among the highest for three at the retest. This 
personality scale just does not operate as do 
the vocational interests scales. Other than this 


sampled and tested in different years (e.g., Banker, see Table 34), Thus numbers 


scale, the other high scores were appropriate. 
The accountants scored highest on the busi- 
ness-oriented scales: Office Practices, Sales, 
and Merchandising; the dentists and physi- 
cians scored highest on the Medical Service 
and Science scales, and so forth. 

The patterns of high scores on both the test 
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TABLE 47 


MALE Occupations MEAN SCORES ON THE OFFICE Practices SCALE 


M stand- 
ard score 


70 
69 
68 
67 


65 Bus Ed Teacher 


61 Baker 34/ Banker 64 
59 Office Worker 


Sample 


58 Accountant/ Soc Sci Teacher/ Credit Manager 


55 Rehab Counselor/ Guidance Counselor/ Comm Rec Director/ Math-Science Teacher 
54 Salesmen, 3M/ YMCA Sec/ Salesmen, PG&E/ CPA 65/ CPA 44/ School Super 65 
53 Public Admin/ Purch Agent / Policemen / Legislator/ Funeral Director 

52 Salesmen, 3M applicants/ Pharmacist/ Farmer 67/ Governor 


51 Vet 66/ Cham Comm Exec/ YMCA PD 


46 Psychologist 47/ Sociologist / Radiologist / Pediatrician/ Unitarian Minister / Pilot/ Dentist / Mathe- 


matician 


45 Judge / Psychologist 31) Chemist / Lawyer 27/ Corp Pres 65/ Psychologist 49/ Musician/ Orthopedic 


Surgeon 

44 Biologist/ Economist/ Political Scientist 
Men 

43 Engineer/ Psychologi 
Neurological Surgeon 

42 Physician 27/ Surgeon/ Pathologist 


/ Mpls Symphony/ Physician 49/ Psychiatrist/ Advertising 


st (Exp)/ Newsmen/ Physicist 27/ Salesmen, Life 66/ Internist/ Astronaut/ 


41 Architect/ McKnight Fellow/ Pulitzer Prize/ Physicist 67 


40 Danforth Fellow/ Authors/ Anthropologist 


39 Photographer/ NIAL Member 
38 


37 Artist 
36 
35 


Note.—In some cases occupational groups were sam 
following samples indicate appropriate year, 


and retest administrations again make it very 
clear that the scales have a substantial rela- 
tionship to the individual’s occupational 
choice. 

The gains reported in Table 58 are worthy 
of note also as they indicate areas of interests 
showing the greatest increase over the college 


pled and tested in different years (e. g., Banker, see Table 34). Thus numbers 


years, irrespective of initial level of score. In 
general, the changes should please university 
faculties. The largest gain for all six groups was 
on the Teaching scale, ranging from 7 points 
among the lawyers to 10 points among the 
dentists, a full standard deviation. All of the 
groups also showed increased scores on the Art, 


- 
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Music, and Writing scales, areas particularly 
important to a liberal arts college faculty. 
Interestingly, this tendency was less pro- 
nounced among the accountants who were pri- 
marily students drawn from the Business 
School. However, the other non-liberal-arts 
sample—the mechanical engineers—showed 
the same increases as the other students. 

These increases are very similar to those 
found among the Stanford students in the re- 
liability study reported earlier, showing that 
interests in teaching and in cultural activities 
increase during the college years. The same 
pattern, though less pronounced, appeared in 
the Harvard test-retest profile in Table 32. 
These highly selected students had higher 
scores in these areas when initially tested, and 
their scores did not increase as much as the 
Minnesota and Stanford students. 


Salsemen and Scientists 


Further predictive validity information has 
been developed by studying two subsets of in- 
dividuals in the norm group. As these men have 
been tested twice, once in 1930 as 16-yr.-olds 
and again in 1966 as 52-yr.-olds, it has been 


SALESMEN 


SALES 


SCIENCE 


The test (age 16) and retest (age 52) SVIB 
basic scale profiles for these two groups are 
presented in Fig. 3. 

As the mean profiles indicate, the Sales and 
Science scales separated these groups fairly 
well at age 16, and by age 52 the difference had 
increased. At age 16, the differences were 
roughly 1 SD, increasing to about 1.5 or 2 SDs 
between the adults. The larger differences be- 
tween the adult samples were created because 
the salesmen increased more on the sales- 
oriented scales, the scientists on the science- 
oriented scales, not because either of them had 
decreasing scores. Whether these increases 


possible to study how their test results relate 
to their adult occupations. 

Of the 1,943 boys tested in 1930, 1,214 (63%) 
have been retested in 1966-67. Twenty-five 
percent could not be located currently, 4% 
refused to cooperate in this study, and 8% 
were deceased. (When these basic scales were 
normed, only 647 retests were available.) 

When retested as adults, these men were 
asked to fill in a short checklist describing 
their jobs. This checklist contained “semantic 
differential” type items, that is, pairs of state- 
ments reflecting opposite extremes, and each 
man was asked to indicate where his job fell on 
a line between these pairs. Two of the items 
were “sales versus nonsales“ and “science 
versus nonscience.“ From the total sample, two 
subgroups of individuals were identified, using 
their answers to these two items. The first 
included 101 men who marked the opposing 
extremes, sales, nonscience,” as descriptive of 
their jobs; the other sample included 42 men 
who marked the other extremes, science, non- 
sales.” The following diagram shows graphi- 
cally how these groups were selected and re- 
ports the number in each group: 


SCIENTIST 


> E Di NONSALES 


X X X NONSCIENCE 


came before or after the occupational ex- 
perience cannot be determined from these 
data; the latter seems more plausible. 

These mean differences on the basic scales, 
though large and statistically far beyond the 
chance level, were not as large as the differ- 
ences between these two samples on the regular 
SVIB occupational scales. For example, on the 
SVIB Chemist scale, the samples had the fol- 
lowing test and retest means—test: Scientist 
M = 39, Salesman M = 26, Dy = 13; retest: 
Scientist M = 43, Salesman M = 17, Dy = 26. 
On the SVIB Sales Manager scale, the samples 
had the following test and retest means test: 


Public Speaking 13 S . of Pe 

Law/Politics r ete foe A e 

Business Management EIR Ure, oes „ 

Sales 4, SN alata te ao sta ae ' 

Merchandising B E T a ‘ 

Office Practices ay eae al rae ore e ’ 

Military Activities eee 8 so . 3 e 

Technical Supervision er be er aA ee a ae TTET & = 

Mathematics Se ee ae Fy AED ip © [le Make kaart è aso 

Science a e T „ foes e 

Mechanical „ r astm © sole ae ie - 
„ ' 


Nature r mC Sees 


Agriculture ai oe sink eae Ce ew 


Adventure . = Sales Test 


Recreational Leadership ....'.... = Sales Retest 


—— 
—— 

ay ZA = Mean increases among salesmen 
oo 


Medical Service „e. 

Social Service F eee = Science Test 

Religious Activities e Aa at's .. @--@ = Science Retest 

Teaching E es a = Mean increases among scientists 
Music Bask n 1 $ ' 

Art SONG sep OE a A ama a ALLES a e 
Writing a „ 8 gS a fe ty Re sate 2 S = 


Fic. 3. Test and retest profiles for salesmen and scientist samples. 
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TABLE 4 
Mate Occerations Meax Scoars ow Pumtic SPEAKING SCALE 


Unitarian Minister / Minister 65 


Priest/ Cham Comm Exec/ Saleamen, Real Fatate/ Political Selentist/ Saleamen, Computer 
Saleamen, 3M applicants/ Minister 27/ School Super 65/ Saleamen, Life 66 

Salesmen, 3M/ Comm Ree Director/ Danforth Fellow/ Social Worker 67 

YMCA Sec/ Social Worker S/ YMCA Stafl/ Judge/ Newamen/ Student Personnel 

Sociologist/ Economist/ Elem Teacher/ Lawyer 49 

Soe Sei Teacher/ Saleamen, Steel/ Guidance Counselor/ Generals and Admirals/ Navy Officer/ 
Astronau 


t 
Rehab Counselor/ Credit Manager / School Super 30/ Air Force Officer/ McKnight Fellow/ Lawyer 27 
Public Admin / CPA 65/ Optometrist/ Corp Pres 65/ Music Teacher/ Bus Ed Teacher 

Salesmen, Life 31/ Librarian/ Pulitzer Prize/ Army Officer/ Physiatrist/ Animal Husbandry Prof 
Psychologist 47/ YMCA PD/ Salesmen, PG&E/ Personnel Director/ Computer Programmer / Phy- 
sical Therapist/ Psychologist 49/ Petroleum Engineer/ Ni Surgeon 

Vet 66/ Pediatrician / Sales Manager/ Paychiatrist/ Orthopedic Surgeon / Anthropologist 

Psychologist 31/ Funeral Director/ Biologist/ Urologist/ Saleamen, Auto/ Physicist 67/ Internist/ 
Math Science Teacher / NIAL Meml er 

Salesmen, Real Estate/ Accountant/ Radiologist/ Photographer / Psychologist (Exp)/ Surgeon/ 
Policemen / CPA 44/ Banker 64/ Ostcopath/ Authors/ Interior Decorator/ Advertising Men 

Office Worker/ Printer / Production Manager/ Purch Agent/ Mpls Symphony/ Banker 34/ Phy- 
sician 49/ Musician/Pa! 

Pharmacist/ Farmer 36/ Pilot/ Vet 49 

Physician 27/ Corp Pres 35/ Forest Service Men/ Dentist/ Mathematician 

Chemist/ Engineer/ Physicist 27 

Architect/ Carpenter 

Farmer 67 

Artist 


3882888888888 


So £2835 


Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


Salesman M = 35, Scientist M = 27,D = 8; science and sales components. In the Chemist 
r test: Salesman M = 41, Scientist M = 20, scale, these components are weighted posi- 
Du = 21. tively and negatively, respectively; in the 
These larger differences, exceeding 2 SDs Sales Manager scale, the reverse is true. Thus 
among the adult samples, can probably be the occupational scales take advantage of both 
traced to the relative heterogeneity of the components for maximum separation, while 
occupational scales compared to the basic the basic scales sacrifice some of this differ- 

scales. Each of these two occupational scales, entiation for easier interpretability. Note again 
_ Chemist and Sales Manager, contains both that there were differential increases (and, 
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TABLE 49 
MALE OCCUPATIONS MEAN SCORES ON THE RECREATIONAL LEADERSHIP SCALE 


M stand- 
ard score 


61 YMCA PD 


Sample 


— — —— — —— —— —Z—ͤ —— 


59 Salesmen, 3M applicant/ Comm Rec Director 


58 Salesmen, 3M/ YMCA Staff / Physical Therapist 


57 
56 

55 Policemen 

54 Salesmen, Steel / Elem Teacher / Salesmen, Auto/ Salesmen, Computer 


53 Soc Sci Teacher / Salesmen, Life 66/ Social Worker 67 


52 Credit Manager / CPA 65 / Pilot / Army Officer / Legislator/ Air Force Officer/ Math- Science Teacher 
51 YMCA Sec / Salesmen, PG&E/ Forest Service Men / School Super 65/ Osteopath / Petroleum Engi- 
neer / Orthopedic Surgeon / Astronaut / Vet 49 


50 Vet 66 Cham Comm Exec / Printer / Guidance Counselor / Urologist/ Computer Programmer / Gen- 


erals and Admirals / CPA 44/Newsmen/ Bus Ed Teacher 


49 Rehab Counselor/ Social Worker 53/ Office Worker / Salesmen, Life 31/ Personnel Director / Salesmen, 


cal Surgeon 


Encyclopedia / Optometrist / Dentist/ Navy Officer / Minister 65/ Banker 64/ Physiatrist/ Neurologi- 


48 Priest/ Salesmen, Real Estate/ Pharmacist/ Public Admin/ Accountant/ Radiologist/ Pediatrician/ 


Production Manager/ Carpenter / Farmer 36/ Surgeon/ Purch Agent/ Sales Manager/ Animal Hus- 
bandry Prof/ Farmer 67/ Student Personnel/ Governor 


47 School Super 30/ Unitarian Minister/ Lawyer 49/ Minister 27/ Funeral Director/ Lawyer 27/ Corp 


Pres 65/ Physician 49/ Psychiatrist/ Musician 


46 Psychologist 47/ Physician 27/ Chemist/ Banker 34/ Advertising Men 

45 Sociologist/ Engineer / Danforth Fellow / Corp Pres 35/ Music Teacher/ Internist 

44 Psychologist 31/ Photographer/ Mpls Symphony/ Judge/ Pathologist 

43 Architect / Biologist/ Economist / McKnight Fellow / Pulitzer Prize/ Psychologist 49/ Artist / Authors 
42 Psychologist (Exp)/ Political Scientist/ Librarian 


41 Physicist 67/ Anthropologist 
40 Physicist 27 


39 Mathematician/ Interior Decorator 
38 NIAL Member 

37 

36 

35 


Note.—In some cases occupational groups were sampled and tested in different years (e. g., Banker, see Table 34). Thus numbers 


following samples indicate appropriate year. 


contrary to the findings on the basic scales, 
actual decreases on the nonappropriate scale) 
on these scales for these two differing samples. 


Construct Validity, Interest Constellations, and 
Implications 


All three types of validity information re- 
ported here, content, concurrent, and predic- 


tive, support the conclusion that these scores 
are related to occupational choice. Collectively, 
they also argue for the existence of the fourth 
type of validity that psychologists have found 
useful: construct validity, Construct validity 
refers to the concepts that underlie test be- 
havior, concepts which may not be directly ac- 
cessible for measurement but which provide 
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TABLE 50 
MALE OCCUPATIONS MEAN Scores on RELIGIOUS ACTIVITIES SCALE 


M stand- 
ard score 
— — — —— ——Uy.Wn...ů——ů —— 


Sample 


71 YMCA Sec / Minister 27 
70 Minister 65 
69 YMCA Staff 


67 Priest 
66 YMCA PD 


64 Unitarian Minister 


61 Comm Rec Director/ Legislator 

60 Rehab Counselor/ Phsyical Therapist 

59 Salesmen, 3M/ Cham Comm Exec/ Social Worker 53/ Guidance Counselor/ School Super 65/ 
Student Personnel 

58 Salesmen, 3M applicant/ Credit Manager / Salesmen, Encyclopedia/ Elem Teacher / Danforth Fellow/ 

Salesmen, Life 66/ Social Worker 67/ Music Teacher 

57 Soc Sei Teacher 

56 Salesmen, Steel/ Optometrist/ Banker 64/ Physiatrist/ Bus Ed Teacher 

55 Sociologist/ School Super 30/ CPA 65/ Funeral Director/ Farmer 67 

54 Vet 66/ Librarian/ Salesmen, Computer/ Math-Science Teacher/ Governor 

53 Computer Programmer/ Judge/ Lawyer 49/ Air Force Officer / Animal Husbandry Prof / Petroleum 

Engineer 

52 Salesmen, Life 31/ Policemen/ Mpls Symphony/ Navy Officer/ Astronaut/ Vet 49 

51 Public Admin/ Radiologist/ Office Worker/ Personnel Director/ Salesmen, Auto/ Banker 34/ Polit- 

ical Scientist/ Newsmen/ Generals and Admirals/ Orthopedic Surgeon/ Neurological Surgeon 

50 Pharmacist/ Accountant/ Pediatrician/ Biologist/ Urologist/ Economist/ Pulitzer Prize/ Army 

Officer/ CPA 44/ Corp Pres 65/ Musician/ Osteopath 

49 Psychologist 47/ Salesmen, PG&E/ Printer/ Production Manager/ Farmer 36/ McKnight Fellow/ 

Sales Manager/ Physician 49/ Interior Decorator 

48 Carpenter/ Purch Agent/ Corp Pres 35/ Psychologist 49/ Mathematician/ NIAL Member 

47 Salesmen, Real Estate/ Engineer/ Photographer/ Surgeon/ Forest Service Men/ Dentist/ Lawyer 27/ 

Physicist 27/ Psychiatrist 

46 Pilot/ Physicist 67/ Pathologist/ Internist 

45 Architect/ Physician 27/ Chemist/ Authors/ Anthropologist/ Advertising Men 

44 Psychologist 31/ Psychologist (Exp) 

43 Artist 


Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34), Thus numbers 
following samples indicate appropriate year. 


some explanatory power for the parallel and to suggest a mechanism to explain an indi- 
features of test responses and actual behavior. vidual's answers to these items. 

What follows here is a brief attempt to make The underlying concepts here are the related 
psychological sense of these clusters of interests constellations of interests. Somehow these con- 
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TABLE 51 
MALE OCCUPATIONS MEAN SCORES ON THE SALES SCALE 
M stand- Sample 
ard score 
70 
69 
68 
67 Salesmen, 3M applicants 


66 Salesmen, 3M 


64 Salesmen, PG&E/ Salesmen, Encyclopedia 


63 Salesmen, Auto/ Salesmen, Computer 


61 Salesmen, Steel / Salesmen, Life 31/ Salesmen, Life 66 


59 Salesmen, Real Estate/ Sales Manager/ Bus Ed Teacher 


56 YMCA Sec/ Credit Manager/ Legislator 


55 Pharmacist/ Cham Comm Exec/ Funeral Director 


54 Office Worker/ Guidance Counselor 


53 Rehab Counselor/ Soc Sci Teacher/ YMCA Staff/ School Super 65 


52 YMCA PD/ Corp Pres 35/ Air Force Officer 


vertising Men/ Banker 64/ Farmer 67 


/ Purch Agent / Comm Rec Director / Banker 34/ Ad- 


51 Vet 66/ Corp Pres 65/ Petroleum Engineer/ Vet 49 

50 Accountant/ Social Worker 53/ CPA 65/ Production Manager/ Personnel Director/ Carpenter/ Op- 
tometrist/ Physical Therapist/ Music Teacher/ Interior Decorator/ Math-Science Teacher/ Governor 

49 School Super 30/ Farmer 36/ Elem Teacher / Policemen/ Minister 27/ Navy Officer / Minister 65 

48 Priest / Printer / Army Officer / CPA 44 / Physiatrist/ Social Worker 67/ Osteopath 

47 Public Admin/ Radiologist/ Unitarian Minister/ Urologist/ Forest Service Men/ Pilot/ Computer 
Programmer/ Lawyer 49/ Animal Husbandry Prof/ Musician/ Orthopedic Surgeon 

46 Psychologist 47/ Dentist/ Generals and Admirals/ Astronaut/ Neurological Surgeon/ Student Per- 


sonnel 


45 Pediatrician/ Chemist/ Engineer/ Surgeon / Librarian/ Physician 49/ Psychiatrist/ Psychologist 49 
H Sociologist/ Mpls Symphony/ Judge/ Newsmen/ Lawyer 27/ Internist 

43 Architect/ Psychologist 31/ Physician 27/ Biologist/ Economist/ Photographer 

42 Psychologist (Exp)/ McKnight Fellow/ Political Scientist/ Danforth Fellow/ Pathologist/ Mathe- 


matician 


41 Pulitzer Prize/ Physicist 27/ Physicist 67/ Artist/ Authors 


40 Anthropologist 
39 NIAL Member 


Note.—In some cases occupational groups were sampled and tested in different years (e. g., Banker, see Table 34). Thus numbers 


following samples indicate appropriate year. 


stellations dictate, at least to some degree, the 
individual’s feeling toward activities which 
fall within the bounds of the constellation. It 
might be fruitful in future research to consider 
an individual’s answer to a specific item (and 
thus his feeling toward a specific behavior) as 
a combination of his perception of which con 
stellation an item belongs to and of his attrac- 
tion toward that constellation generally. Thus, 
the two salient variables are his evaluation of 
constellation membership for an item and his 


attraction for the activities within the constel- 
lation—in a convenient shorthand, Perception 
Attraction. 

A clarifying example: A salesman would 
answer Like to the item “Sell Fuller brushes 
door-to-door” because he would perceive that 
as a sales item and he likes sales activities; 4 
scientist would mark it Dislike because he also 
perceives it as a sales item and dislikes those 
activities. But if confronted with the item, 
“Work for the National Academy of Sciences, 
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TABLE 52 
MALE Occupations MEAN SCORES ON THE SCIENCE SCALE 


M stand- 
ard score 


62 Physicist 27/ Physicist 67 
61 Chemist 
60 Biologist/ Psychologist (Exp)/ Pathologist 


Sample 


eee 


59 Psychologist 31/ Animal Husbandry Professor/ Neurological Surgeon 
58 Pediatrician/ Urologist/ Internist/ Mathematician/ Math-Science Teacher 
57 Radiologist/ Engineer/ Air Force Officer/ Physiatrist/ Psychiatrist/ Psychologist 49/ Orthopedic 


Surgeon/ Astronaut 
56 Psychologist 47/ Surgeon/ Anthropologist 


55 Physician 27/ Economist/ Optometrist/ Computer Programmer/ Danforth Fellow/ Physician 49/ 


Physical Therapist/ Petroleum Engineer 
54 Army Officer 


53 Pilot/ Dentist/ Generals and Admirals/ Osteopath 
52 Vet 66/ Pharmacist/ Public Admin / Architect/ Sociologist/ School Super 30/ Mpls Symphony/ Sales- 


men, Computer/ CPA 44/ Navy Officer 


51 Rehab Counselor/ YMCA PD/ Unitarian Minister/ Production Manager/ Personnel Director/ Elem 
Teacher/ Librarian/ Minister 27/ NIAL Member/ Student Personnel 

50 Accountant/ Guidance Counselor/ Corp Pres 35/ Farmer 36/ Forest Service Men/ Political Scientist/ 
Policemen/ School Super 65/ Corp Pres 65/ Musician/ Artist/ Vet 49 

49 YMCA Sec / Social Worker 53/ Credit Manager / Salesmen, PG&E/ Printer / Photographer/ Judge / 


Laywer 49 


48 Salesmen, 3 M/ Priest / Salesmen, Steel / Office Worker / CPA 65 / Purch Agent / Pulitzer Prize / Comm 
Rec Director / Lawyer 27/ Music Teacher / Bus Ed Teacher 
47 Soc Science Teacher / Carpenter McKnight Fellow / Sales Manager / YMCA Staff / Legislator/ 


Minister 65/ Social Worker 67/ Governor 


46 Salesmen, 3M applicant/ Cham Comm Exec/ Advertising Men/ Banker 64/ Author 
45 Salesmen, Encyclopedia/ Salesmen, Auto/ Newsmen 
44 Salesmen, Life 31/ Funeral Director/ Banker 34 


43 Salesmen, Real Estate 


42 Salesmen, Life 66/ Farmer 67/ Interior Decorator 


Note. In some cases occupational groups were sampled and tested in different years (e,g., Banker, see;Table 34). Thus numbers 


following samples indicate appropriate year. 


appearing before Congress, private founda- 
tions, and other prestige organizations, selling 
the idea that science must be broadly sup- 
ported by our society,” they both might 
answer Like as one would perceive it as a sales 
activity, the other as science. In this illustra- 
tion, their perceptions might be modified sub- 
stantially by substituting other verbs for 


“selling,” such as persuading or, better yet, 
educating or informing. 

This formulation of organizing constellations 
as the constructs underlying interest measure- 
ment is undoubtedly too simple, and only par- 
tially correct. Yet it does provide a way to 
begin to understand more than the empirical 
facts, to lead us out of the wilderness of means 
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TABLE 53 


MALE OCCUPATIONS MEAN SCORES ON THE SOCIAL SERVICE SCALE 


M stand- 


ard score Sample 
70 
69 YMCA Sec 
68 
67 Social Worker 53/ YMCA Staff/ Minister 65 
66 Social Worker 67 
65 YMCA PD/ Minister 27 
ot 
63 Rehab Counselor/ Priest/ Unitarian Minister/ Guidance Counselor 


62 Soc Sci Teacher 
61 Psychologist 47/ Comm Rec Director / Student Personnel 
60 Salesmen, 3M/ Legislator/ Physical Therapist 


59 Salesmen, 3M applicants/ Sociologist/ Elem Teacher/ Danforth Fellow/ School Super 65/ Music 


Teacher 
58 Salesmen, Encyclopedia/ Bus Ed Techer 
57 School Super 30 


56 Cham Comm Exec / Credit Manager / Librarian / Physiatrist / Salesmen, Life 66/ Math-Science 


Teacher / Governor 


55 Pediatrician/ Salesmen, Life 31 / Policemen / Political Scientist / Mpls Symphony / Funeral Director / 


Psychiatrist 


54 Pharmacist / Public Admin / Salesmen, Steel / Personnel Director / Optometrist / Lawyer 49/ Psychol- 


ogist 49/ Musician / Vet 49 


53 Vet 66/ Salesmen, PG&E/ CPA 65/ Economist/ McKnight Fellow/ Newsmen/ Banker 64/ Farmer 


67/ Osteopath 


52 Judge/ Office Worker/ Printer/ Carpenter/ Salesmen, Auto/ Banker 34/ Salesmen, Computer/ 


CPA 44/ Internist 


51 Accountant/ Radiologist/ Urologist/ Photographer/ Forest Service Men/ Computer Programmer/ 
Corp Pres 35/ Air Force Officer/ Physician 49/ Orthopedic Surgeon/ Interior Decorator/ NIAL 


Member 


50 Salesmen, Real Estate/ Production Manager/ Psychologist (Exp)/ Farmer 36/ Sales Manager/ 


Pulitzer Prize / Lawyer 27/ Corp Pres 65/ Anthropologist/ N eurological Surgeon 


49 Psychologist 31/ Physician 27/ Biologist/ Surgeon/ Purch Agent/ Dentist/ Navy Officer/ Animal 


Husbandry Prof/ Petroleum Engineer/ Pathologist/ Advertising Men 
48 Army Officer/ Mathematician 


47 Architect/ Chemist/ Engineer/ Pilot/ Generals and Admirals/ Physicist 27/ Physicist 67/ Astro- 


naut/ Authors 
46 Artist 


Note.—In some cases occupational groups were sampled and tested in different years (e. g., Banker, see Table 34). Thus numbers 


following samples indicate appropriate year. 


and correlations, and it has some immediate agreement as to what an item means should be 
implications for the techniques of measurement: very high. This can be achieved in several 
ways: by using short familiar items; by using 
longer very specific items; or by using powerful 


1. If item perception is important, then item 
clarity is a paramount concern. Interindividual stereotypes such as occupational titles. 
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TABLE 54 
MALE OCCUPATIONS MEAN SCORES ON THE TEACHING SCALE 


rr T 


M stand- 
ard score Sample 

SS — a — . m 

70 

69 

68 

67 

66 Music Teacher 

65 

64 

63 Elem Teacher 

62 Librarian / Danforth Fellow/ Minister 65/ School Super 65 

61 School Super 30/ Unitarian Minister/ Guidance Counselor/ Minister 27/ Bus Ed Teacher/ Student 
Personnel 

60 Rehab Counselor/ Soc Sci Teacher/ Social Worker 67 

59 Priest / Sociologist / Mpls Symphony / Comm Rec Director/ YMCA Staff / Physiatrist 

58 Social Worker 53 / Biologist / McKnight Fellow / Political Scientist / Mathematician/ Math. Science 
Teacher / Neurological Surgeon 

57 Psychologist 47/ Psychologist 31/ Pediatrician / Legislator / Physical Therapist / Psychologist 49 

56 YMCA Sec/ Economist/ Psychiatrist/ Animal Husbandry Prof/ Musician/ Physicist 67/ Pathologist/ 
Internist/ Anthropologist 

55 YMCA PD/ Psychologist (Exp)/ Judge/ Salesmen, Encyclopedia/ Optometrist/ Pulitzer Prize/ 
Generals and Admirals/ Navy Officer/ Physicist 27/ Orthopedic Surgeon/ NIAL Member 

54 Salesmen, 3M applicant/ Cham Comm Exec/ Credit Manager/ Air Force Officer/ Surgeon/ Computer 
Programmer/ Lawyer 49/ Salesmen, Computer/ Newsmen 

53 Salesmen, 3M/ Radiologist/ Photographer/ Army Officer/ Salesmen, Life 66/ Astronaut 

52 Vet 66/ CPA 65/ Urologist / Personnel Director / Corp Pres 65/ Physician 49/ Governor 

51 Public Admin / Chemist / Petroleum Engineer / Interior Decorator 

50 Physician 27 / Salesmen, Steel / CPA 44 

49 Accountant/ Lawyer 27/ Banker 64/ Osteopath 

48 Salesmen, PG&E/ Printer/ Policemen/ Dentist/ Vet 49 

47 Pharmacist/ Architect/ Office Worker/ Salesmen, Life 31/ Production Manager/ Advertising Men/ 
Authors 

46 Engineer/ Corp Pres 35/ Carpenter/ Farmer 36/ Salesmen, Auto/ Forest Service Men/ Pilot/ Artist 

45 Sales Manager 

44 Funeral Director/ Banker 34/ Farmer 67 

43 Salesmen, Real Estate/ Purch Agent 

42 

41 

40 

39 

38 

37 

36 

35 


Note. — In some cases occupational groups were sampled and tested in different years (e. g., Banker, see Table 34), Thus numbers 
following samples indicate appropriate year. 


2. It suggests that forced-choice items are 


method is used, it should be minimized. 


too complicated. To ask a person to sort out 
his perceptions of three items in a triad, then 
to determine how he feels about each of these 
and compare those feelings is creating an un- 
necessarily complex task for him. Worse, it 
means the same pattern of answers could result 
from several different psychological viewpoints. 
While that is inevitable, no matter what 


(Forced-choice items may have some ad- 
vantages in other contexts—but they have yet 
to be demonstrated empirically.) 

3. The Perception X Attraction theory em- 
phasizes that, to this point in history, little 
attention has been paid to the perception por- 
tion in interest measurement—all of the em- 
phasis has been placed on the aifraction aspect 
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TABLE 55 
MALE Occupations MEAN SCORES ON TECHNICAL SUPERVISION SCALE 


M stand- 
ard score 


70 
69 


57 Air Force Officer/ Petroleum Engineer 
56 Production Manager 
55 Chemist/ Army Officer 


Sample 


54 Engineer/ Carpenter/ Pilot/ Salesmen, Computer 
53 Personnel Director/ Navy Officer/ Orthopedic Surgeon/ Astronaut 


51 Public Admin / Radiologist / Salesmen, Steel/ Salesmen, PG&E/ Corp Pres 35/ Urologist/ Computer 
Programmer/ Corp Pres 65/ Math-Science Teacher 


50 Salesmen, 3M applicant / Salesmen, 3M/ Purch Agent/ Policemen/ Generals and Admirals/ CPA 44/ 


45 Architect/ Biologist/ Student Personnel 


44 Priest/ Sociologist/ Physician 27/ Salesmen, 


Minister 65/ Musician/ Mathematician 
43 Salesmen, Real Estate / S 
42 Music Teacher / Governor 


Encyclopedia / Librarian / Banker 34/ Minister 27/ 


chool Super 30/ Political Scientist/ Mpls Symphony/ Anthropologist 


41 Salesmen, Life 31/ Photographer/ Lawyer 27/ Advertising Men 
40 Danforth Fellow/ N. ewsmen/ Salesmen, Life 66/ Interior Decorator 


39 Judge/ Artist 
38 McKnight Fellow/ NIAL Member 
37 Pulitzer Prize/ Authors 


Note.—In some cases occupational groups were sampled and tested in different years (e. g., Banker, 


following samples indicate appropriate year. 


by requiring the Like, Indifferent, or Dislike 
response. More should be done on perception, 
at least enough to determine if this way of 
thinking has any merit. 

4. If the perception factor does prove useful, 
it might help explain the changes over time 
within a single individual as it would allow for 
two types of changes, one in perception, the 


see Table 34), Thus numbers 


other in actual attraction to the activities. 
Such a possibility is particularly appealing 
when studying teenage-to-adult samples as 
some of their differences may be traced to the 
teenagers’ difficulty with the vocabulary. Per- 
ception certainly includes within it under- 
standing; if a person does not understand an 
item, his perception will be affected. When 
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TABLE 56 
Mate Occupations MEAN SCORES ON THE WRITING SCALE 


. ² AA 


M stand- 
ard score 


70 
69 
68 
67 


Sample 


64 Unitarian Minister/ Librarian/ Pulitzer Prize/ Newsmen 


63 McKnight Fellow/ Authors 


62 Political Scientist/ Danforth Fellow/ NIAL Member 


61 Minister 65 


60 Psychologist 47/ Judge/ Photographer/ Minister 27/ Sociologist 
59 Anthropologist/ Neurological Surgeon/ Social Worker 63 
58 Economist/ Elem Teacher/ Legislator/ Social Worker 67/ Music Teacher/ Student Personnel/ Ad- 


vertising Men/ Cham Comm Exec/ Psychologist 31 


57 Rehab Counselor/ Priest/ Mpls Symphony/ Lawyer 49/ School Super 65/ Physiatrist/ Psychiatrist/ 


Psychologist 49/ Governor/ YMCA Sec/ Pediatrician 


56 School Super 30/ Psychologist (Exp)/ Salesmen, Encyclopedia/ Lawyer 27/ Musician 
55 Salesmen, 3M applicants/ Salesmen, 3M/ Printer/ Guidance Counselor/ Biologist/ Personnel Di- 


orator/ Public Admin/ Soc Sci Teacher 


rector / Comm Rec Director/ Army Officer / Generals and Admirals / Artist / Internist / Interior Dec- 


54 Credit Manager / YMCA Staff / Salesmen, Life 66/ Bus Ed Teacher / Physicist 67/ Orthopedic Sur- 


geon / Radiologist 


53 Urologist / Surgeon / Computer Programmer / Salesmen, Computer / Navy Officer / Physical Thera- 


pist / Pathologist / YMCA PD 


52 Salesmen, Steel / Office Worker / Salesmen, Life 31/ Salesmen, PG&E/ Air Force Officer / Optometrist/ 


CPA 44/ Physician 49/ Astronaut / Mathematician / Architect 


51 Corp Pres 35/ CPA 65/ Sales Manager/ Animal Husbandry Prof/ Accountant/ Chemist 
50 Forest Service Men/ Policemen/ Pilot/ Corp Pres 65/ Osteopath/ Petroleum Engineer/ Math-Science 


Teacher/ Salesmen, Real Estate/ Physician 27 


49 Engineer / Production Manager / Salesmen, Auto / Dentist/ Banker 34/ Physicist 27/ Pharmacist 
48 Vet 66 / Funeral Director / Farmer 36/ Purch Agent / Banker 64 


47 Carpenter 
46 Vet 49 


43 Farmer 67 


Jote. In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34), Thus numbers 


following samples indicate appropriate year. 


understanding comes, his answer may change 
even though his attraction to that activity has 
not. 


This brief theoretical side trip illustrates one 
possibility of further research in clustering 
interests, and in understanding the results. 
After 40 yr. of fact gathering, we must have 


some organizing theory, especially since the 
computer has made it possible to carry empiri- 
cism to ridiculous extremes. 


USE OF THE Basic SCALES 


This paper has been almost solely concerned 
with the construction and evaluation of scales 


TABLE 57 
MEAN Test AND RETEST PROFILES FOR THE BERDIE-SCHLETZER CURRICULAR Groups 


—== 
Accou: i ournalists Mechanical Engi- Physicians 
Total sample 20 Dentists (N = 30) N 5 Lawyers (N= 32) e (N 20 (N 26) 
5 Test | Retest M M M M M M 
* E 0 c C c 
M| SD | M| SD Test | Retest Test | Retest Test | Retest Test | Retest Test | Retest Test | Retest 
pe AE Sei 3 — 
1 Public Speaking 53 | 10.8 | 57 | 11.0 | .67 | +4 52 55 46 47 55 61 +6 | 63 65 50 54 +4 | 54 59 
2 Law/Politics 53 | 10.6 | 57 10.67% +4 | 55 | 60 | +5] 47 50 54 59 | +5] 6t | 66 | +5] 50 | 55 | +5] 52] 55 
3 Business Manage- 
ment 49 | 10.3 | 50 | 10.5 | .46 54 56 46 43 51 53 56 54 46 51 +5 45 43 
4 Sales 53 | 10.1 | 48 | 9.5] 38] —5 58 50 =8 50 44 —6 58 54 —4 58 49 — 50 49 48 42 
5 Merchandising 50 | 9.9 | 50 | 10.4 | 52 56 56 47 45 52 54 56 52 -4 47 50 46 43 
6 Office Practices 51 | 10.2 | 46 | 9.9] .56 | —5 61 55 —6 50 43 gi 49 47 53 48 — 47 45 46 42 
7 Military Activities 53 9.0 50 9.443 53 48 52 52 53 51 55 52 53 51 50 40 
8 Technical Super- 
vision 48 9.6 | 50 | 11.7 | .37 46 50 +4 | 48 47 46 42 —4 | 47 45 53 61 +8 | 46 48 
9 Mathematics 53 8.6 |54| 9.6 | .69 54 57 53 53 44 41 49 50 59 60 57 59 
10 Science 52| 9.553 9.3 | .60 48 50 54 55 46 45 46 47 58 58 60 60 
11 Mechanical 48 | 9.9 |48 | 10.9 | .72 45 46 49 49 45 41 —4 42 42 57 58 45 47 
12 Nature 42 9.8 | 45 | 10.6 | .64 38 39 44 49 +5 41 40 39 41 44 47 40 50 
13 Agriculture 43 | 10.0 | 44 | 10.1 | .53 40 40 43 46 42 40 40 41 47 47 4s 44 
14 Adventure 58 | 10.5 | 57 | 11.0 | .47 56 53 59 56 57 59 59 56 62 61 55 53 
15 Recreational 
Leadership 52 9.8 | 50 | 9.8] .65 55 52 53 53 51 49 53 49 —4 51 53 46 47 
16 Medical Service 52] 9.8] 55 | 10.5 | 56 48 50 57 63 +6 48 49 50 50 49 52 62 64 
17 Social Service 52 | 9.6 | 52| 9.7 44 55 54 50 50 53 56 53 53 48 49 55 51 
18 Religious Activities |52 10.4 | 53 | 11.3 | .38 52 53 50 52 54 55 54 52 50 53 54 56 
19 Teaching 45| 9.3 538.336 +8 46 54 +8 42 52 +10 47 55 +8 46 53 +7 42 51 +9 46 55 
20 Music 46 | 10.3 | 51 | 9.4] .55 48 51 44 48 +4 | 48 52 +4 | 50 53 41 49 +8 | 48 56 
21 Art 45 8.5 50 9.8] .57 +5 43 46 45 51 +6 48 55 +7 45 49 +4 44 49 +5 45 51 
22 Writing 49 | 9.2 |53 | 10.1 | .63 | +4 50 41 43 45 58 64 +6 56 60 +4 44 49 +5 48 54 
Mdn 56 
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* Mean changes larger than 4 points. 


TABLE 58 


THREE HIGHEST TEST AND RETEST MEANS AND LARGEST GAINS FOR BERDIE-SCHLETZER Six CURRICULAR GROUPS 


a a aall 


Accountants M Dentists M Journalists M Lawyers M | Mechanical Engineers} M | Physicians | M 

Test 

Office Practices 6¹ Adventure 59 Writing 58 Public Speaking 63 Adventure | 62 Medical Service | 62 

Sales 58 Medical Service | 57 Sales 58 Law/Politics 61 Mathematics 59 Science 60 

Merchandising 56 Science 54 Adventure 57 Adventure 59 Science 58 Mathematics 57 

Adventure 56 | 
Retest 

Law/Politics 60 Medical Service | 65 Writing 64 Law/Politics 66 Technical Supervision | 61 Medical Service | 64 

Mathematics 57 Adventure 56 Public Speaking | 61 Public Speaking | 65 Adventure 61 Science 60 

Business Management | 56 Science 55 Law / Politics 50 Writing 60 Mathematics 60 Public Speaking 80 

Merchandising 56 Adventure 59 Mathematics 59 
Gains 

Teaching 46-54 | Teaching 42-52 | Teaching 47-55 | Teaching 45-53 | Teaching | 42-51 | Teaching 46-55 

Law/Politics 55-60 | Medical Service | 57-63 | Art 48-55 | Law/Politics 61-66 | Technical Supervision | 53-61 | Music 48-56 

Technical Supervision | 46-50 | Art 45-51 | Writing 58-64 | Art 45-49 | Music 41-49 | Art 45-51 

Public Speaking | 55-61 | Writing 56-60 Writing 48-54 
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to measure specific areas of vocational interests. 
An important next step is to learn something 
about the application of these scales. 


For Counseling 


Counseling textbooks abound with view- 
points as to how test information should be 
used with individual clients. Research on these 
viewpoints is much less common, and there are 
very few data to support answers to even the 
most obvious questions: “It costs about one 
dollar to administer and score a student’s 
Strong—is it worth it?” or “Should an indi- 
vidual be shown his scores?” In general, 
opinions here vary inversely in intensity with 
the amount of available relevant information. 


For Selection 


The situation with the use of interest in- 
ventories in employee selection is not greatly 
different—the basic questions on how to apply 
the methods have not been answered, and the 
only acceptable applications currently depend 
on the presence of a skilled, well-trained 
professional. 

While much more research on the effects of 
various approaches is necessary, in the absence 
of such information the authors suggest the 
following guideline: 


These scales are reasonably accurate measures of an 
individual’s interest in the specified areas. Anyone con- 
fronted with a choice of any of these activities, whether 
for training or employment, would be aided by this 
relevant information about himself, Professional psy- 
chologists faced with selection or placement decisions 
should find the results helpful in determining what acti- 
vities an individual might find satisfying. 

Though the basic scales should be useful in 
evaluating an individual’s responses to the 
SVIB, they are intended to supplement, not 
supplant, the regular occupational scales. The 
regular scales still permit greater diversity and 
complexity in profile patterns than do the basic 

scales, as the unique items remaining in the 
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SVIB, not included in any of the basic scales, 
are important. It would be a serious mistake 
to restrict measurements to these 22 factors in 
the belief that the valid variance in the interest 
domain has been covered, or even the valid 
variance in the SVIB item pool. Further re- 
search must push ahead to find other ap- 
proaches, always with the intent of finding 
powerful yet parsimonious explanations of the 
organization of an individual’s choices, 
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